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Ireland. 

do hereby declare this invention to be described in the following statement: 



NEW SEQUENCES FOR ALTERING MAMMALIAN OVARIAN FUNCTION 
AND OVULATION RATE 



Technical Field 

The present invention relates to new sequences for altering mammalian ovarian 
5 function and ovulation rate. 

In particular, the invention broadly concerns a novel mutation in the GDF 9 gene and 
two novel mutations in the GDF 9B gene. These mutations have been found to be 
involved in increasing the ovulation rate in heterozygous female mammals; or causing 
sterility in homozygous female mammals. 

10 Background Art 

The genes GDF9 and GDF9B (also known as BMP 15) code for proteins which are 
expressed exclusively in the oocyte of the developing follicle, and which play an 
essential role in mammalian fertility. GDF9 is a member of the transforming growth 
factor beta (TGF0) superfamily (McPherron and Lee, 1993) which is expressed in 

15 oocytes from the primary stage of follicular development until ovulation (McGrath et 
aL, 1995; Laitinen et aL, 1998). GDF9B is closely related to GDF9 (Dube et aL, 1998; 
Laitinen et aL, 1998) and is expressed in mouse oocytes at the same time as GDF9, but 
in human primary follicles slightly later than GDF9. In the ovary GDF9 and GDF9B 
have now been shown to be expressed exclusively in the developing oocyte in humans 

20 (Aaltonen et aL, 1999), rodents (Laitinen et aL, 1998; Dube et aL, 1998; Jaatinen et aL, 
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1999) , ruminants (Bodensteiner et aL, 1999; Bodensteiner et aL, 2000; Galloway et aL, 

2000) and marsupials (Eckery et aL, 2002). In sheep expression of GDF9 can be seen 
in primordial follicles whereas GDF9B is expressed in primary follicles (Bodensteiner 
etaL, 1999; Galloway etaL, 2000). 

25 GDF9 is an essential growth factor for folliculogenesis in mice. Female GDF9 
knockout mice (GDF9 -A) are infertile due to a block in follicular development at the 
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primary stage (Dong et aL, 1996). GDF9B does not appear to be crucial for mouse 
folliculogenesis as knockout female mice (BMP15 -A) are fertile (Yan et aL, 2001), 
even though fecundity is somewhat reduced. However, GDF9B is essential for 
folliculogenesis in sheep as those carrying two copies of naturally-occurring 
inactivating GDF9B mutations are infertile due to a block in follicular development at 
the primary stage (Galloway et aL, 2000). 

In sheep it is also clear that heterozygotes carrying inactivating mutations in one copy 
of GDF9B (whereby only one copy of the gene produces active protein) have an 
increased ovulation rate (Galloway et aL, 2000). A similar increase in ovulation rate in 
heterozygote mice with knockouts in either GDF9 or GDF9B has not been observed 
(Yan et al, 2001). Double knockouts of both GDF9 and GDF9B in mice are infertile 
with a similar phenotype to GDF9 -/- mice alone, but GDF9B knockout mice (BMP 15 
-/-) with one active copy of GDF9, have a lower fecundity than BMP 15 females 
(Yan et aL, 2001), suggesting that the relative dose of these gene products may also 
play a role in mice. Collectively these findings suggest that important differences exist 
in the actions of GDF9 and GDF9B between species with a high ovulation rate 
phenotype (e.g. mice, rats) and those with a low ovulation rate phenotype (e.g. sheep, 
humans). 

GDF9 maps to a region of sheep chromosome 5 (Sadighi et al., 2002) which is 
syntenic to the map locations for GDF9 on human chromosome 5 and mouse 
chromosome 11 (Mouse Genome Informatics (2002). GDF9B maps to the sheep X 
chromosome (Galloway et al., 2000) in a region of the chromosome syntenic to the 
map locations for GDF9B on the human and mouse X chromosomes (Dube et aL, 
1998; Aaltonen et al, 1999). It has also recently been mapped to the pig X 
chromosome pi 1-p 13 region (Grapes and Rothschild, 2002) 

GDF9 and GDF9B, like other members of the TGFp family,' are coded as 
prepropeptides containing a signal peptide, a proregion and a C-terminal mature region 



which is the biologically active peptide. Cleavage of the mature region from the 
proregion is carried out by an intracellular fiirin-like protease, and occurs at a 
conserved furin protease cleavage site. Members of the TGFp superfamily are 
biologically active as dimers, and although GDF9 and GDF9B do not contain the 
cysteine molecule responsible for covalent interchain disulphide bonding seen in other 
members of the family, these molecules are thought to be biologically active as dimers 
(Galloway et al. 9 2000; Yan et al., 2001). However it is unclear whether the 
physiologically active dimers are homodimers (GDF9-GDF9 and GDF9B-GDF9B), or 
heterodimers (GDF9-GDF9B) or whether all three dimer forms play a role. It has been 
postulated based on the above models that GDF9 homodimers play a more important 
role in the mouse but in sheep the GDF9B homodimers are the most bioactive (Yan et 
ah, 2001). It is unclear whether any such difference is related to the fact that sheep are 
mono-ovulatory animals (maturing usually only one egg per cycle) whereas mice are 
poly-ovulatory. Clearly both GDF9 and GDF9B play crucial roles in controlling and 
maintaining fertility in mammals, and understanding the nature of their actions is 
essential for the development of therapies. 

GDF9 and GDF9B in sheep 

The sheep GDF9 gene spans about 2.5 kb and contains 2 exons separated by a single 
1126 bp intron (Bodensteiner et a/., 1999). The full length coding sequence is 1359 
nucleotides long and encodes a pre-propeptide of 453 amino acid residues (Genbank 
accession number AF078545). A pre-pro region of 318 residues includes a predicted 
signal sequence, and ends with the RHRR furin protease cleavage site at residues 315- 
318. Residues 319 to 453 beyond the cleavage site code for the 135 amino acid mature 
active peptide. The amino acid sequence of the sheep GDF9 mature peptide is 92.8 % 
similar to the human mature peptide and 87.1 % similar to the mouse mature peptide. 

Sheep GDF9B has previously been sequenced by us (Galloway et ah 2000; Genbank 
accession nos. AF236078, AF236079) and has a very similar gene structure to GDF9. 



The full length coding sequence of 1179 nucleotides is contained in two exons, 
separated by an intron of about 5.4 kb, and encodes a pre-propeptide of 393 amino acid 
residues. A pre-pro region of 268 residues includes a predicted signal sequence, and 
ends with the RRAR furin protease cleavage site at residues 265 - 268. Residues 269 
to 393 beyond the cleavage site code for a 125 amino acid mature active peptide. The 
amino acid sequence of the sheep GDF9B mature peptide is 78.3 % similar to the 
human mature peptide and 78.6 % similar to the mouse mature peptide. 

We have previously shown that the effects on prolificacy in Inverdale and Hanna sheep 
is due to naturally-occurring mutations in GDF9B (Galloway et aL, 2000). Both 
Inverdale and Hanna sheep have increased ovulation rates in heterozygous carriers of 
mutated GDF9B, but female homozygous carriers are infertile with 'streak' ovaries 
(Davis et al. 9 2001). Infertility in these sheep is due to primary ovarian failure caused 
by the inability of the follicle to develop beyond the primary stage. Hanna sheep have a 
single C to T mutation at nucleotide 871 of the GDF9B coding sequence (nucleotide 
67 of the mature GDF9B peptide coding region) which produces a premature stop 
codon in the place of a glutamic acid (Q) at amino acid residue 291 (residue 23 of the 
mature protein). Inverdale sheep have a distinct T to A mutation at nucleotide 896 
(nucleotide 92 of the mature GDF9B peptide coding region) which substitutes valine 
(V) for aspartic acid (D) at residue 299 (residue 31 of the mature peptide). This 
substitution of a hydrophobic valine with a negatively charged aspartate changes the 
electrostatic surface potentials of an area involved in dimer formation and appears to 
disrupt dimerisation and hence abolish biological activity (Galloway et al 9 2000). 

In addition to the Inverdale and Hanna lines of sheep discussed above, the Cambridge 
and F700 Belclare strains of sheep have also been shown to carry genes affecting 
prolificacy as evidenced in high ovulation rate (Hanrahan & Owen, 1985; Hanrahan, 
1991) and the presence of sterile ewes with 'streak-like' ovaries (Hanrahan, 1991; 
Hanrahan, 1996). 



The Cambridge breed was established at the Cambridge University farm in 1964 by 
screening 54 ewes selected for their high prolificacy from nine British sheep breeds 
(Owen, 1991). Ewes within the screened flock were subsequently selected on high 
litter size. Ewes with the highest ovulation rates were selected from this flock in 1984 
5 to provide the foundation animals for the flock now maintained at Teagasc Sheep 
Research Centre in Ireland (Hanrahan, 1991). A progeny test of 10 Cambridge rams, 
descended from the flock in Ireland, gave progeny mean ovulation rates ranging from 
2.1 - 4.2 (Hanrahan, 1996). 

The Belclare breed was established in 1978 at the Belclare Research Centre of Teagasc 
10 in Ireland by crossing three populations of prolific sheep assembled by Teagasc in 
Ireland. These were Fingalway, High Fertility, and Lleyn sheep (Hanrahan, 1989; 
Hanrahan, 1991). The Fingalway was an interbred cross (from Fl) of the Finnish 
Landrace and Galway breeds; the Lleyn is a breed native to north west Wales and 
selected animals were imported into Ireland in 1975 by Teagasc for the purposed of 
15 developing the Belclare breed; the High Fertility was developed in Ireland during the 
1960s from ewes with exceptional litter size performance collected from farms in 
Ireland between 1963 and 1965. The details of the breed composition of the 
foundation animals for High Fertility line were given by Hanrahan (1984). A subline 
of the Belclare (called F700 line) was derived from Belclare sheep that had 
20 exceptionally high ovulation rates (Hanrahan 1991). Progeny of 10 Belclare rams had 
mean ovulation rates ranging from 1.9-4.2 (Hanrahan, 1996). 

We describe here new naturally-occurring mutations in sheep GDF9 and GDF9B. We 
show for the first time that mutation of GDF9 causes increased ovulation rate as well 
as infertility in a manner similar to inactivating mutations in GDF9B, and that GDF9 is 
25 also essential for maintaining normal ovarian folliculogenesis in sheep. Furthermore, 
we show, for the first time in any species, that sheep which are heterozygous for both 
GDF9 and GDF9B mutations have higher ovulation rates than sheep that are 
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heterozygous for GDF9 or GDF9B mutations alone ; these observations are supported 
by genotype, phenotype and immunisation data. 



All references, including any patents or patent applications cited in this specification 
are hereby incorporated by reference. No admission is made that any reference 
5 constitutes prior art. The discussion of the references states what their authors assert, 
and the applicants reserve the right to challenge the accuracy and pertinency of the 
, cited documents. It will be clearly understood that, although a number of prior art 
publications are referred to herein, this reference does not constitute an admission that 
any of these documents form part of the common general knowledge in the art, in New 
10 Zealand or in any other country. 

It is acknowledged that the term 'comprise' may, under varying jurisdictions, be 
attributed with either an exclusive or an inclusive meaning. For the purpose of this 
specification, and unless otherwise noted, the term 'comprise' shall have an inclusive 
meaning - i.e. that it will be taken to mean an inclusion of not only the listed 
15 components it directly references, but also other non-specified components or 
elements. This rationale will also be used when the term 'comprised' or Comprising 1 is 
used in relation to one or more steps in a method or process. 

It is an object of the present invention to address the foregoing problems or at least to 
provide the public with a useful choice. 

20 Further aspects and advantages of the present invention will become apparent from the 
ensuing description which is given by way of example only. 

Disclosure of Invention 

The present invention is concerned with novel mutated GDF 9 and GDF 9B gene 
sequences which alter mammalian ovarian function and ovulation rate. The invention 
25 broadly has application in increasing or decreasing the ovulation rate, or causing 
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sterility in a female mammal, and additionally encompasses regulation of the function 
ofthecorpeus luteum. 

In particular, the present invention concerns a novel mutation in GDF 9 which 
increases ovulation rate in heterozygotes and causes sterility in homozygotes for this 
5 gene. 

The present invention also concerns two mutations in GDF 9B which increase the 
ovulation rate in heterozygotes for either, but not both together, mutations of the GDF 
9B gene. Mammals which are heterozygotes for both mutations in GDF 9B (where 
each mutation is on a separate X chromosome) are sterile. 

10 The inventors have also discerned that in female mammals that are heterozygous for 
. the mutated GDF 9 gene and heterozygous for one, but not both, of the GDF 9B gene 
mutations, an even higher ovulation rate exists than in animals heterozygous for one 
mutation in either GDF 9 or GDF 9B. 

Knowledge of the mutated gene sequences can be applied to a test for identifying 
15 heterozygous or homozygous female and male mammals carrying the mutated gene. 
This knowledge of the biological function of the gene and its mutations can also be 
utilised to increase or decrease the ovulation rate of female mammals, or to induce 
sterility or reduced fertility in female mammals. 

According to a first aspect of the present invention there is provided an isolated 
20 mutated GDF 9 nucleic acid molecule comprising a nucleotide sequence selected from 
the group consisting of: 

a) SEQIDNOs. 1,3 or 5; 

b) a sequence complementary to the molecule defined in a); 

c) a functional fragment or variant of the sequences in a) or b); 

■ 
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d) an anti-sense sequence to any of the molecules defined in a), b)or c). 

Throughout this specification it should understood that the nucleic acid molecule may 
be a RNA, cRNA, genomic DNA Or cDNA molecule, and may be single or double- 
stranded. The nucleic acid molecule may also optionally comprise one or more 
5 synthetic non-natural or altered nucleotide bases, or combinations thereof. 

According to a second aspect of the present invention there is provided an isolated 
mutated GDF 9B nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

■ 

a) SEQIDNOs. 7,9,11,13,15 or 17; 

10 b ) a sequence complementary to the molecule defined in a) 

c) an anti-sense sequence to any of the molecules defined in a) or b). 

According to a third aspect of the present invention there is provided an isolated 
mutated GDF 9 polypeptide comprising an amino acid sequence selected from the 

i 

group consisting of: 
15 a) SEQ ID NOs. 2,4 or 6; 

b) a functional fragment or variant of the sequences in a). 

According to a fourth aspect of the present invention there is provided an isolated 
mutated GDF 9B polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs. 8,10,12,14,16 or 18. 

20 According to a fifth aspect of the present invention there is provided an isolated GDF 9 
nucleic acid molecule comprising a nucleotide sequence consisting of a mutated 
nucleotide sequence which encodes a non-conservative amino acid substitution, in the 
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GDF 9 polypeptide, in at least one codon of the nucleotide sequence associated with 
receptor binding. 

According to a sixth aspect of the present invention there is provided an isolated GDF 
9 nucleic acid molecule comprising a nucleotide sequence consisting of a mutated 
nucleotide sequence which encodes a non-conservative amino acid substitution, in the 
GDF 9 polypeptide, in at least one codon of the nucleotide sequence associated with 
dimerisation of the encoded peptide. 

According to a seventh aspect of the present invention there is provided an isolated 
GDF 9B nucleic acid molecule comprising a nucleotide sequence consisting of a 
mutated nucleotide sequence which encodes a non-conservative amino acid 
substitution occurring in at least one codon of the nucleotide sequence associated with 
receptor binding; wherein said nucleotide sequence does not comprise SEQ ID NO. 5 
disclosed in WO 01/85926. 

According to an eighth aspect of the present invention there is provided an isolated 
GDF 9B nucleic acid molecule comprising a nucleotide sequence consisting of a 
mutated nucleotide sequence which encodes a non-conservative amino acid 
substitution occurring in at least one codon of the nucleotide sequence associated with 
dimerisation of the peptide; wherein said nucleotide sequence does not comprise SEQ 
ID NO. 5 disclosed in WO 01/85926. 

Suitable programs for ascertaining the structure of polypeptides from the amino acid 
sequence which can used to determine the regions of the nucleotide sequence 
associated with dimerisation and/or receptor binding will be known to persons skilled 
in the art. Examples of suitable computer programs include The Modeller by 
Rockerfeller University and The SWISS Model developed by Swiss Protein database. 
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According to a ninth aspect of the present invention there is provided a method of 
identifying an mammal which carries a mutated nucleic acid molecule encoding GDF- 
9B, said method comprising the steps of: 

i) obtaining a tissue or blood sample from the mammal; 

ii) isolating DNA from the sample; and optionally 

iii) isolating GDF-9B DNA from the DNA obtained at step i) or ii); 

iv) probing said DNA with a probe complementary to either strand of the 
mutated GDF 9B DNA of SEQ ID NOs 1 1 or 17; 

v) amplifying the amount of mutated GDF 9B DNA; 

vi) determining whether the GDF 9B sequence DNA obtained in step v) carries 
a mutation associated with sterility or increased ovulation. 

According to a tenth aspect of the present invention there is provided a method of 
identifying an mammal which carries a mutated nucleic acid molecule encoding GDF- 
9, said method comprising the steps of: 

i 

i) obtaining a tissue or blood sample from the mammal; 

9 

ii) isolating DNA from the sample; and optionally 

iii) isolating GDF-9 DNA from the DNA obtained at step i) or ii); 

iv) probing said DNA with a probe complementary to either strand of the 
mutated GDF 9 DNA of SEQ ID NO 5; 

v) amplifying the amount of mutated GDF 9 DNA; 

vi) determining whether the GDF 9 sequence DNA obtained in step v) carries a 
mutation associated with sterility or increased ovulation. 
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According to an eleventh aspect of the present invention there is provided the use of a 
nucleic acid molecule which is complementary to either strand of the mutated DNA of 
SEQ ID NOs. 1 1 or 17 as a marker to identify a mammal carrying a mutated nucleic 
"acid molecule encoding GDF-9B. 

The term 'either strand' refers to both the strand of DNA shown in the Sequence ID 
Number that is being referred to or its complementary strand which is not shown in the 
sequence listing but which can be determined therefrom. 

According to a twelfth aspect of the present invention there is provided the use of a 
marker as claimed above wherein the identification of mammals carrying said mutated 
GDF-9B is for DNA assisted selection of mammals that either have enhanced 
ovulation or sterility. 

According to a thirteenth aspect of the present invention there is provided the use of a 
nucleic acid molecule which is complementary to either strand of the mutated DNA of 
SEQ ID NO 5 as a marker to identify a mammal carrying a mutated nucleic acid 
molecule encoding GDF-9. 

According to a fourteenth aspect of the present invention there is provided the use of a 
marker as claimed above wherein the identification of mammals carrying said mutated 
GDF-9 is for DNA assisted selection of mammals that either have enhanced ovulation 
or sterility. 

According to a fifteenth aspect of the present invention there is provided a probe 
capable of specifically hybridising to either strand of the mutated GDF 9B DNA of 
SEQ ID NOs 11 or 17. 
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According to a sixteenth aspect of the present invention there is provided a probe 
capable of specifically hybridising to either strand of the mutated GDF 9 DNA of SEQ 
ID NO 5. 

According to a seventeenth aspect of the present invention there is provided a construct 
comprising a nucleic acid molecule substantially as described above. 

According to an eighteenth aspect of the present invention there is provided a vector 
comprising a nucleic acid molecule substantially as described above. 

According to a nineteenth aspect of the present invention there is provided a host cell 
which comprises a construct or vector substantially as described above which has been 
introduced therein. 

The term "host cell" refers to a cell which is capable of containing a vector or 
construct and supports the replication and/or expression of the vector or construct. 
Suitable hosts cells may include E.coli, yeast or mammalian cells but should not be 
limited thereto. 

15 According to a twentieth aspect of the present invention there is provided a cell line 
comprising a host cell substantially as described above. 

According to a twenty-first aspect of the present invention there is provided a method 
altering the GDF 9 and/or GDF 9B polypeptide composition of a female mammal so as 
to modulate ovulation comprising the steps of introducing to the genetic material of the 
20 mammal at least one nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

- SEQ ID NOs 1 or a functional fragment or variant thereof ; and 

- SEQ ID NOs 7 or 13 but not both; or 

- SEQ ID NOs 3 or a functional fragment or variant thereof; and 
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- SEQ ID NOs 9 or 15 but not both. 

According to a twenty-second aspect of the present invention there is provided the 
method above wherein the nucleic acid molecule is introduced by a vector or construct. 

According to a twenty-fourth aspect of the present invention there is provided a 
method of modulating the ovulation of a female mammal comprising the steps of: 

a) identifying the nucleotide sequences of GDF 9 or GDF 9B carried by the 
female mammal; 

b) administering as appropriate to a mammal depending on the GDF 9 or GDF 
9B genes present in the mammal a partial immunisation of a composition 
comprising: 

i) a GDF 9 polypeptide or a functional fragment or variant of 
GDF9; and/or 

ii) a GDF 9B polypeptide or a functional fragment or variant of 
GDF 9B; and/or 

iii) a GDF 9 polypeptide and a GDF 9B polypeptide or functional 
fragment or variant of GDF 9 or GDF 9B. 

together with a pharmaceutically or veterinarily acceptable 
carrier and/or diluent; 

so as to effectively modulate the ovulation of the mammal. 

According to a twenty-fifth aspect of the present invention there is provided the 
method above wherein the carrier is or includes a mild adjuvant. 
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The term "mild adjuvant" means an adjuvant that induces a moderate immune 
response. By way of contrast the term "severe adjuvant" means an adjuvant that 
induces strong immune response. 

According to a twenty-six aspect of the present invention there is provided a method of 
modulating the ovulation rate of a female mammal comprising the steps of: 

a) identifying the nucleotide sequences of GDF 9 or GDF 9B carried by the 
female mammal; 

b) administering as appropriate having regard to the GDF 9 and/or GDF 9B 
genes present in the mammal, an effective amount of an agent selected from the 
group consisting of: 

1) an immunising effective amount of a GDF 9 polypeptide and/or 
an immunising effective amount of a GDF 9B polypeptide 
substantially as described above; 

2) antisense nucleic acid molecule(s) directed towards nucleic 
acid(s) encoding: 

i) a GDF 9 polypeptide substantially as described above; and/or • 

ii) a GDF 9B polypeptide substantially as described above. 

According to a twenty-seventh aspect of the present invention there is provided a 
method for breeding a mammal having increased ovulation comprising the steps of: 

a) identifying the nucleotide sequences of GDF 9 or GDF 9B carried by the 
female mammal it is proposed to breed from; 

b) identifying the nucleotide sequences of GDF 9 or GDF 9B carried by the 
male mammal it is proposed to breed from; 
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c) selecting the female and male animals that will result in progeny having the 
following characteristics: 

i) a single copy of a mutated GDF 9 nucleotide sequence 
comprising: 

5 A) SEQ ID NO 5; or 

B) a functional variant or fragment of the molecule in A) ;or 

C) a sequence complementary to the molecule in A) or B); 
and/or - 

ii) a single copy of mutated GDF 9B nucleotide sequence 
10 comprising: 

A) SEQ ID NOs 11 or 17; or 

■ 

B) a sequence complementary to the molecule(s) in A). 

According to a twenty-eighth aspect of the present invention there is provided the 
method above wherein the mammals selected for breeding will result in progeny 
15 having the following characteristics: 

i) a single copy of a mutated GDF 9 nucleotide sequence comprising: 

A) SEQ ID NO 5; or 

B) a functional variant or fragment of the molecule in A) ;or 

C) a sequence complementary to the molecule in A) or B); 

20 ii) a single copy of a mutated GDF 9B nucleotide sequence comprising: 

A) SEQ ID NOs 1 1 or 17; or 
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B) a sequence complementary to the molecule(s) in A). 

According to a twenty-ninth aspect of the present invention there is provided a method 
for. selecting a female mammal for breeding on the basis of possessing an increased 
rate of ovulation comprising the steps of identifying a female mammal possessing only 
5 a single mutated copy of: 

1) a mutated GDF 9 nucleotide sequence comprising: 

a) SEQ ID NO 5; or 

b) a functional variant of the molecule of a); or 

c) a sequence complementary to the molecules in a) or b); 
10 and/or 

2) a mutated GDF 9B nucleotide sequence comprising: 

a) SEQ ID NOs 11 or 17; or 

b) a sequence complementary to the molecules in a). 

According to a thirtieth aspect of the present invention there is provided the method 
15 above wherein the mammal selected has both a single mutated copy of GDF 9 and 
GDF9B. 

According to thirty-first aspect of the present invention there is provided a composition 
comprising: 

i) a mutated GDF 9 polypeptide comprising an amino acid sequence 
20 selected from the group consisting of: 

A) SEQ ID NOs. 2, 4 or 6; or 

B) a functional fragment or variant of the sequences in A); and/or 
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ii) a mutated GDF 9B polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs. 8, 10,12,14,16 or 18 

together with a .pharmaceutical^ or veterinarily acceptable carrier and/or 
diluent. 

5 According to a thirty-second aspect of the present invention there is provided a nucleic 
acid molecule encoding a polypeptide substantially as described above. 

According to a thirty-third aspect of the present invention there is provided a method 
of modifying the function of the corpus luteum by administering supplementary GDF9 
or GDF 9 B , or analogues thereof, or GDF9 or GDF9B antagonists to female 
10 mammals. 

According to a thirty-fourth aspect of the present invention there is provided a method 
of modulating the ovulation rate of a female mammal comprising the steps of altering 
the endogenous levels of GDF 9 and GDF 9B in said mammal. 

The present invention also encompasses ligands as polypeptides substantially as 
15 described above. 

The term "ligand" refers to any molecule which can bind to another molecule such as a 
polypeptides or peptide, and should be taken to include, but not be limited to, 
antibodies, cell surface receptors or phage display molecules. 

It should be appreciated that the term "antibody 5 ' encompasses fragments or analogues 
• 20 of antibodies which retain the ability to bind to a polypeptide of the invention, 
including but not limited to Fr, F(ab) 2 fragments, ScFv molecules and the like. The 

r 

i 

antibody may be polyclonal but is preferably monoclonal. In some embodiments the 
ligand may be a phage display molecule. 

■ 
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The term "analogues" above refers to a compound which has a biological function with 
improved characteristics over the native compounds (e g such a analogue may have a 
longer half-life than the native compound.) 

The term "antagonist" refers to a compound which inhibits the effect of another 
compound. In this context, the antagonist could refer to a purified antibody, a sera or 
serum containing an antibody or a plasma or pool of plasma containing an antibody 
that would neutralise GDF 9 or GDF 9B. 

The term "partial immunisation" refers to immunisation of an animal either active or 
passive of sufficient antigen/antibody to allow for instigation of an immune response 
to be mounted against the antigen; but the degree of antigen/antibody administered 
and/or the means of administration are such that insufficient antibodies are produced 
by the immunised animal to effectively neutralise all the antigen of interest. 

The term "full immune response" refers to the immune response of animal which has 
been fully immunised i.e. the response mounted by the immunised animal results in 
production of sufficient antibodies to effectively neutralise all the antigen of interest. 

The term "protein, or polypeptide" refers to a protein encoded by the nucleic acid 
molecule of the invention, including fragments, mutations and homologues having the 
same biological activity i.e. ovulation modulating activity. The protein or polypeptide 
of the invention can be isolated from a natural source, produced by the expression of a 
recombinant nucleic acid molecule, or chemically synthesized. 

It is to be clearly understood that the invention also encompasses peptide analogues, 
which include but are not limited to the following: 

1. Compounds in which one or more amino acids is replaced by its 
corresponding D-amino acid. The skilled person will be aware that 
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retro-inverso amino acid sequences can be synthesised by standard 
methods; see for example Choreo and Goodman, 1993; 

2. Peptidomimetic compounds, in which the peptide bond is replaced by a 
structure more resistant to metabolic degradation. See for example 

5 Olson et al, 1993; and 

3. Compounds in which individual amino acids are replaced by analogous 
structures for example, gem-diaminoalkyl groups or alkylmalonyl 
groups, with or without modified termini or alkyl, acyl or amine 
substitutions to modify their charge. 

10 The use of such alternative structures can provide significantly longer half-life in the 
body, since they are more resistant to breakdown under physiological conditions. 

Methods for combinatorial synthesis of peptide analogues and for screening of 
peptides and peptide analogues are well known in the art (see for example Gallop et al, 
1994; Hogan, 1997). 

15 For the purposes of this specification, the term "peptide and peptide analogue 11 includes 
compounds made up of units which have an amino and carboxy terminus separated in 
a 1,2, 1,3, 1,4 or larger substitution pattern. This includes the 20 naturally-occurring or 
"common" oc-amino acids, in either the L or D configuration, the biosynthetically- 
available or "uncommon" amino acids not usually found in proteins, such as 4- 

20 hydroxyproline, 5-hydroxylysine, citrulline and ornithine; synthetically-derived oc- 
amino acids, such as cc-methylalanine, norleucine, norvaline, Ca- and N-alkylated 

c 

amino acids, homocysteine, and homoserine; and many others as known in the art. 

It also includes compounds that have an amine and carboxyl functional group 
separated in a 1,3 or larger substitution pattern, such as (3-alanine, y-amino butyric 
25 acid, Freidinger lactam (Freidinger et al, 1982), the bicyclic dipeptide (BTD) 
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(Freidinger et al, 1982; Nagai and Sato, 1985), amino-methyl benzoic acid (Smythe 
and von Itzstein, 1994), and others well known in the .art. Statine-like isosteres, 
hydroxyethylene isosteres, reduced amide bond isosteres, thioamide isosteres, urea 
isosteres, carbamate isosteres, thioether isosteres, vinyl isosteres and other amide bond 
5 isosteres known to the art are also useful for the purposes of the invention. 

A "common" amino acid is a L-amino acid selected from the group consisting of 
glycine, leucine, isoleucine, valine, alanine, ■ phenylalanine, . tyrosine, tryptophan, 
aspartate, asparagine, glutamate, glutamine, cysteine, methionine, arginine, lysine, 
proline, serine, threonine and histidine. These are referred to herein by their 
10 conventional three-letter or one-letter abbreviations. 

An "uncommon" amino acid includes, but is not restricted to, one selected from the 
group consisting of D-amino acids, homo-amino acids, N-alkyl amino acids, 
dehydroamino acids, aromatic amino acids (other than phenylalanine, tyrosine and 
tryptophan), ortho-, meta- or para-aminobenzoic acid, ornithine, citrulline, norleucine, 
15 a-glutamic acid, aminobutyric acid (Abu), and oc-a disubstituted amino acids. 

The term "introducing" (or grammatical variations thereof) when used in the context of 
inserting a nucleic acid molecule into a cell, means "transfection"or "transformation" 
or "transduction" and includes reference to any method for incorporation or transfer of 
a nucleic acid molecule into a eukaryotic or prokaryotic cell for expression or 
20 replication thereof (for example this may include but should not be limited to insertion 
of a nucleic acid into: a chromosome, mitochondrial DNA, an autonomous replicon 
(eg. a plasmid). The term "transduction" as used herein, refers to the process of 
transferring genetic information from a nucleic acid molecule from one cell to another 
by way of a viral vector. 

25 The term "transfection" as used herein, refers to the uptake, incorporation, and 
expression of recombinant DNA by eukaryotic cells. 
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The term "transformation" as used herein refers to a process by which the genetic 
material carried by an individual cell is altered by incorporation of exogenous DNA 
into its genome. 

The term "variant" as used herein refers to nucleotide and polypeptide sequences 
wherein the nucleotide or amino acid sequence exhibits substantially 60% or greater 
homology with the nucleotide or amino acid sequence of the Figures, preferably 75% 
homology and most preferably 90-95% homology to the sequences of the present 
invention. - as assessed by GAP or BESTFTT (nucleotides and peptides), or BLASTP 
(peptides) or BLAST X (nucleotides). The variant may result from modification of the 
native nucleotide or amino acid sequence by such modifications as insertion, 
substitution or deletion of one or more nucleotides or amino acids or it may be a 
naturally-occurring variant. The term "variant" also includes homologous sequences 
which hybridise to the sequences of the invention under standard hybridisation 
conditions defined as 2 x SSC at 65°C, or preferably under stringent hybridisation 
conditions defined as 6 x SCC at 55°C, provided that the variant is capable modulating 
the ovulation rate of a female mammal or altering ovarian function. Where such a 
variant is desired, the nucleotide sequence of the native DNA is altered appropriately. 
This alteration can be effected by synthesis of the DNA or by modification of the 
native DNA, for example, by site-specific or cassette mutagenesis. Preferably, where 
portions of cDNA or genomic DNA require sequence modifications, site-specific 
primer directed mutagenesis is employed, using techniques standard in the art. 

A "fragment" of a nucleic acid is a portion of the nucleic acid that is less than full 
length, and comprises at least a minimum sequence capable of hybridizing specifically 
with a nucleic acid molecule according to the invention, or a sequence complementary 
thereto, under stringent conditions as defined below. A "fragment" of a polypeptide is 
a portion of the polypeptide which is less than full length, but which still retains the 
biological function of either; increasing or decreasing the ovulation rate of a mammal, 
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causing sterility in a mammal; or altering the regulation of the corpus luteum. Hence, 
a fragment according to the invention has at least one of the biological activities of the 
nucleic acid or polypeptide of the invention. However, it will be appreciated that the 
biological activity of a fragment of the GDF 9 sequence of the present invention 
encompass only those mutations which will increase the ovulation rate in female 
mammals heterozygous for the mutation. 

The term "isolated" means substantially separated or purified away from 
contaminating sequences in the cell or organism in which the nucleic acid naturally 
occurs and includes nucleic acids purified by standard purification techniques as well 
as nucleic acids prepared by recombinant technology, including PCR technology, and 
those chemically synthesised. 

The term "modulation of ovulation" means increasing or decreasing the rate of 
ovulation compared to the endogenous rate observed in an untreated animal. 

The present invention also includes primers specific for the GDF 9 and GDF 9B 
nucleotide sequences of the present invention. 

The term "hybridization" or grammatical variants thereof means the process of joining 
two complementary strands of DNA or one each of DNA and RNA to form a double 
stranded molecule. 

"Probes" are single-stranded nucleic acid molecules with a known nucleotide sequence 
which is labelled in some way (for example, radioactively, fluorescently or 
immunologically), which are used to find and mark a target DNA or RNA sequence by 
hybridizing to it. In the present invention the probe will generally be hybridized to the 
target DNA or RNA sequence under stringent conditions so the probe is specific for 
the GDF 9 or GDF 9B nucleotide sequences of the present invention. 
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^^^k "Primers" are short nucleic acids, preferably DNA oligonucleotides 15 nucleotides or 

more in length, which are annealed to a complementary target DNA strand by nucleic 
acid hybridization to form a hybrid between the primer and the target DNA strand, then 
extended along the target DNA strand by a polymerase, preferably a DNA polymerase. 
5 Primer pairs can be used for amplification of a nucleic acid sequence, e.g. by the 
polymerase chain reaction (PCR) or other nucleic acid amplification methods well 
known in the art. PCR-primer pairs can be derived from the sequence of a nucleic acid 
according to the present invention, for example, by using computer programs intended 
for that purpose such as Primer (Version 0.5® 1991, Whitehead Institute for 
10 Biomedical Research, Cambridge, MA). 

Methods for preparing and using probes and primers are described, for example, in 
Sambrook et al. Molecular Cloning: A Laboratory Manual, 2nd ed, vol. 1-3, ed 
Sambrook et al. Cold Spring Harbour Laboratory Press, Cold Spring Harbour, NY, 
1989. 

15 Probes or primers can be free in solution or covalently or noncovalently attached to a 
solid support by standard means. 

"Stringent conditions' 1 for the amplification of a target nucleic acid sequence (eg by 

* ' 

PCR) using a particular amplification primer pair, are conditions that permit the primer 
pair to hybridize only to the target nucleic acid sequence to which a primer having the 
20 corresponding wild type sequence (or its complement) would bind. 

Nucleic acid hybridization is affected by such conditions as salt concentration, 
temperature, or organic solvents, in addition to the base composition, length of the 
complementary strands, and the number of nucleotide base mismatches between the 
hybridizing nucleic acids, as will be readily appreciated by those skilled in the art. 
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When referring to a probe or primer, the term "specific for (a target sequence)" 
indicates that the probe or primer hybridizes under stringent conditions only to the 
target sequence in a given sample comprising the target sequence. 

The term "construct" as used herein refers to an artificially assembled or isolated 
nucleic acid molecule which includes the gene of interest. In general a construct may 
include the gene or genes of interest and appropriate regulatory sequences. It should 
, be appreciated that the inclusion of regulatory sequences in a construct is optional for 
example, such sequences may not be required in situations where the regulatory 
sequences of a host cell are to be used. The term construct includes vectors but should 
not be seen as being limited thereto. 

The term "vector" as used herein encompasses both cloning and expression vectors. 
Vectors are often recombinant molecules containing nucleic acid molecules from 
several sources. 

A "cloning vector" refers to a nucleic acid molecule originating or derived from a 
virus, a plasmid or a cell of a higher organism into which another exogenous (foreign) 
nucleic acid molecule of interest, of appropriate size can be integrated without loss of 
the vector's capacity for self-replication. Thus vectors can be used to introduce at least 
one foreign nucleic acid molecule of interest (e.g. gene of interest) into host cells, 
where the gene can be reproduced in large quantities. 

An "expression vector" refers to a cloning vector which also contains the necessary 
regulatory sequences to allow for transcription and translation of the integrated gene of 
interest, so that the gene product of the gene can be expressed. 

The cloning vector may be selected according to the host or host cell to be used. 
Useful vectors will generally have the following characteristics: 
(a) the ability to self-replicate; 
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(b) the possession of a single target for any particular restriction endonuclease; and 

(c) desirably, carry genes for a readily selectable marker such as antibiotic 
resistance. 

Two major types of vector possessing these characteristics are plasmids and bacterial 
viruses (bacteriophages or phages). Presently preferred vectors are bacterial, insect or 
mammalian vectors and may include the following: the pUC, pBlueScript, pGEM, 
PGEX, pBK-CMV, lambda ZAP, lambda GEM, pEFIRES-P, pUB6/V5/His, pBCl, 
pADTrack-CMV, pAdenovator, pAdEasy-1, pSFV-PD, pCA3, pBABE, pPIC9, 
pA0815, pET and pSP series. However, this list should not be seen as limiting the 
scope of the present invention. 

Examples of preferred expression systems are as follows: 

1 . For an in vitro cell expression system, the 293T cell system with a pEFIRES-P 
vector (Hobbs S et al, 1998) which confers puromycin resistance may be used. 
For coexpression of two genes, the aforementioned vector may be modified to 
change the antibiotic resistance gene to bleomycin resistance. Alternatively, 
the co-expression of two genes and the selection gene can be achieved by 
constructing a tricistronic expression vector. A corresponding stably 
transfected insect cell system can also be used, e.g. the S2 cell system using 
"DES" vector expression system; www.invitrogen.com. 

2. With respect to expressing GDF's in all tissues of transgenic animals, one 
approach is to use the pUB6/V5-His A vector ( www.invitrogen.com') to make 
the constructs. For tissue-specific expression the rat PEPCK 0.6 kb promoter 
for liver and kidney expression can be included in the construct by replacing 

(" 
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the Ubi-C promoter in the pUB6/V5-His A vector with the PBPCK promoter. 
For GDF expression in mammary tissue another promoter system would be 
preferred. For this tissue one approach would be to use the bovine (3- 

lactoglobulin gene promoter and/or the bovine a SI casein promoter (e.g. 
pBCl vector, www.invitrogen.com') to drive the expression of the GDFs into 
milk. For global over-expression in transgenic animals, the CMV enhanced 
actin promoter (Okabe M, et al.; FBBS Letters 407: 313-319, 1997) or a 
modified BF1 cc-promoter can be used also (Taboit-Dameron F, et al., 
Transgenic Research 8: 223-235, 1998). 

Adenoviruses, retroviruses and alphaviruses are other suitable mammalian expression 
systems. A typical approach to those skilled in the art is that described by (TC He et 
al, 1998),. With respect to GDF expression the pAd Track-CMV vector or 
pAdenovator vectors rwww.qbiogene.com) can be used to make the construct which is 
then co-transformed with pAd Easy-1 adenoviral plasmid into E. coli to generate a 
recombinant adenoviral genome which contains a CMV-promoter driven GDF 
expression cassette. This recombinant adenoviral genome is then transfected into 
293T cells to make the virus stock. Alternative methods for generating adenoviruses 
can also be used for the same purpose (e.g. PCA3 plasmid based gene transfer 
(www.microbix.com) : or COS-TPC method (Miyake S et al., 1996). 

3. Non-cytopathogenic Semliki Forest viruses expressing GDF's can be generated 
using, for example, pSFV-PD vectors as described by Lundstrom et al., 
Histochem Cell Biol 115: 83-91, 2001. Furthermore, retroviral expression 
systems based on, for example, pBABE vectors, can be used for expressing 

27 



GDFs in mammalian cells (Morgenstem, JP and Land, H, 1990; Nucleic Acids 
Res 18: 3587-3596). 

4. Yeast cells (e.g. Pichia pastoris, Saccharomyces cerevisiae) are another well 
established expression system to those skilled in the art (C Hadfield, et ah, 
1993);(MA Romanos et a/., 1992),. For example, the pPIC9 vector 
rwww.invitrogen.com) can be used in Pichia pastoris for the expression of 
GDF's. For coexpression of two genes, the vector pA0815 
rwww.invitrogen.com) is a preferred candidate. 

5. Echerichia coli (E. colt) is a standard laboratory expression system in 
widespread use. For example, the pET expression system (www.novagen.com) 
can be used to express recombinant mammalian GDF-9 and GDF-9B 

The DNA molecules of the invention may be expressed by placing them in operable 

\ 

linkage with suitable control sequences in a replicable expression vector. Control 
sequences may include origins of replication, a promoter, enhancer and transcriptional 
terminator sequences amongst others. The selection of the control sequence to be 
included in the expression vector is dependent on the type of host or host cell intended 
to be used for expressing the DNA as would be understood by a person skilled in the 
art. 

The term "operably linked" or grammatical variant thereof as used herein means that 
the regulatory sequences necessary for expression of the gene of interest are placed in 
the nucleic acid molecule in the appropriate positions relative to the gene to enable 
expression of the gene. 
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As used herein the term "regulatory sequences" refers to certain nucleic acid sequences 
such as origins of replication, promoters, enhancers, polyadenylation signals, 
terminators and the like, that enable expression of the nucleic acid molecule of interest. 

The term "expression" as used herein broadly refers to the process by which a nucleic 
acid molecule is converted by transcription and then translation into a protein. 

The term "gene" as used herein refers to a nucleic acid molecule comprising an 
ordered series of nucleotides that encodes a gene product (i.e. specific protein). 
The expression vectors useful in the present invention may contain at least one 
expression control sequence that is operatively linked to the DNA sequence or 
fragment to be expressed. The control sequence is inserted in the vector in order to 
control and to regulate the expression of the cloned DNA sequence. Examples of 
useful expression control sequences are the lac system, the trp system, the tac system, 
the trc system, major operator and promoter regions of phage lambda, the glycolytic 
promoters of yeast acid phosphatase, e.g. Pho5, the promoters of the yeast alpha- 
mating factors, and promoters derived from polyoma, adenovirus, retrovirus, and 
simian virus, and cytomegalovirus e.g. the early and late promoters of SV40, and other 
sequences known to control the expression of genes of prokaryotic and eukaryotic ceils 
and their viruses or combinations thereof. 

In the construction of a vector it is also an advantage to be able to identify the bacterial 
clone carrying the vector incorporating the foreign DNA. Such assays include 
measurable colour changes, antibiotic resistance and the like. In one preferred vector, 
the (3-galactosidase gene is used, which gene is detectable by clones exhibiting a blue 
phenotype on X-gal plates. This facilitates selection. Once selected, the vectors may 

► 

be isolated from the culture using standard procedures. 
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Depending on the host used, transformation and transfection is performed according to 
standard techniques appropriate to such cells. For prokaryotes or other cells that 
contain substantial cell walls, the calcium treatment process (Cohen, S N Proceedings, 
National Academy of Science, USA 69 2110 (1972)) may be employed. For 
5 mammalian cells without such cell walls, the calcium phosphate precipitation method 
of Graeme and Van Der Eb, Virology 52:546 (1978) or liposomal reagents are 
preferred. 

Upon transformation of the selected host with an appropriate vector the polypeptide 
encoded can be produced, often in the form of a fusion protein, by culturing the host 
10 cells. The polypeptide of the invention may be detected by rapid assays as indicated 
above. The polypeptide is then recovered and purified as necessary. Recovery and 
purification can be achieved using any procedures known in the art, for example by 
absorption onto and elution from an anion exchange resin. This method of producing a 
polypeptide of the invention constitutes a further aspect of the present invention. 

15 The preparation of pharmaceutical compositions including pharmaceutical carriers are 
well known in the art, and are set out in textbooks such as Remington's Pharmaceutical 
Sciences, 19th Edition, Mack Publishing Company, Easton, Pennsylvania, USA. 

The compounds and compositions of the invention may be administered by any 
suitable route, and the person skilled in the art will readily be able to determine the 
20 most suitable route and dose for the condition to be treated. Dosage will be at the 
discretion of the attendant physician or veterinarian, and will depend on the nature and 
state of the condition to be treated, the age and general state of health of the subject to 
be treated, the route of administration, and any previous treatment which may have 
been administered. 
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The carrier or diluent, and other excipients, will depend on the route of administration, 
and again the person skilled in the art will readily be able to determine the most 
suitable formulation for each particular case. 

The invention also includes adenovirus-based gene therapy techniques for expressing 
GDF-9B and GDF-9/GDF-9B in cell cultures, organ cultures and whole experimental 
animals for manipulkting ovarian follicular protein synthesis or production. 

Brief Description o f drawings 

Further aspects of the present invention will become apparent from the following 
description which is given by way of example only and with reference to the 
accompanying drawings in which: 

Figure 1 Shows Irish Cambridge and F700 Belclare sheep GDF9 sequence and 

mutations; 

Figure 2 Shows Irish Cambridge and F700 Belclare sheep GDF9B sequence and 

mutations; 

Figure 3 Shows schematic representation of genotypes within a F700 Belclare 

and Cambridge pedigrees; 

Figure 4 Shows nucleotide and amino acid of wildtype sheep GDF9 showing 

positions of mutations in Irish Cambridge and F700 Belclare sheep; 

Figure 5 Shows nucleotide and amino acid of sheep GDF9B showing positions of 

mutations in Irish Cambridge and F700 Belclare sheep; 

Figure 6 Shows alignment of GDF9 and GDF9B protein sequence with other 

members of the TGFp superfamily members for which structures have 
been determined; 
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Figure 7 Shows examples of the pattern of progesterone concentrations in plasma 

of actively immunized ewes; and 

Figure 8 Shows the average concentrations of progesterone in plasma following 

synchronization of luteal regression. 

Best Modes for Carrying out the Invention 

Non-limiting examples illustrating the invention will now be provided. It will be 
appreciated that the above description is provided by way of example only and 
variations in both the materials and techniques used which are known to those persons 
skilled in the art are contemplated. 

METHODOLOGY 

Animals 

The flocks of Cambridge and'Belclare sheep at The Sheep Research Centre of Teagasc, 
Athenry, are routinely examined for ovulation rate at the beginning of each mating 
season using laparoscopy. The examination is done once before joining and once after 
the first mating of the joining period. These data have been collected each year since 
these flocks were established. The flocks are self-contained with at least 5 males used 
for mating each year. In addition rams from these flocks have been progeny tested for 
ovulation rate by crossing with Galway and Scottish Blackface ewes - both low 
prolificacy breeds. Ovulation rate measurements are done by laparoscopy under licence 
from The Minister of Health under the Cruelty to Animals Act (1876) EU Directive 
86/609/EC. 

When sterile ewes were first detected they were checked for the possibility that they 
were freemartins but this could not be confirmed (Hanrahan, 1991). Blood samples 
were retained for DNA extraction from the sterile Cambridge ewes born in 1990 and 
later years and from essentially all of the F700 Belclare sterile females born since 
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1993. This material has been supplemented by blood samples for DNA extraction 
collected from fertile ewes in these flock from 1992 onwards. 

Ovulation rate data were analysed by least squares procedures with the individual 
animal as the experimental unit using the GLM procedure of SAS. The factors in the 
models were ewe, age, year of record, and the number of copies (0 or 1) of each of the 
mutations described below. 

Genomic DNA was isolated from Irish Cambridge and F700 Belclare sheep either 

* 

from frozen stored buffy coat or directly from white blood cells in whole blood using 
the method of Montgomery and Sise (Montgomery and Sise, 1990). Parentage of key 
pedigrees was verified with autosomal sheep microsatellite markers OarHH64 (sheep 
chromosome 4), OarCP34 (sheep chromosome 3) and OarFCB304 (sheep chromosome 
19) (Maddox et al, 2001). 

Sequencing and Mutation Detection 

The sheep GDF9 and GDF9B genes were amplified using the polymerase chain 
reaction (PCR) with primers designed from published sheep sequences (sheep genomic 
GDF9B . exon 1, AF236078; sheep genomic GDF9B exon 2, AF236079; sheep 
genomic GDF9 exon 1 and 2, AF078545). 

The PCR primers used were as follows: 

GDF9B exon 1 B 1 3 : 5 ' - ACTGCTGCCTTGTCCCAC-3 ' 

B28: 5 ' -AGGCAATGTGAAGCCTG AC A-3 ' 

GDF9B exon 2 B25: 5 ' -C AGTTTGT ACTGAGC AGGTC-3 ' 

04: 5 ' -TTCTTGGGAAACCTGAGCTAGC-3 ' 

GDF9 exon 1 Gl: 5'-GAATTGAACCTAGCCCACCCAC-3' 
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G4: 5 ' - AGCCT AC ATC AACCC ATGAGGC-3 ' 
GDF9 exon 2 G5: 5 ' - ATCCC ACCCTGACGTTT AAGGC-3 ' 

G7: 5 ' -TCCTCCC AAAGGC AT AG AC AGG-3 ' 
The resulting PCR products were sequenced on an ABI 373 sequencer. 
Single Stranded Conformational Polymorphism Detection 

SSCP (single stranded conformational polymorphism) was carried out on 9 Belclare 
rams involved in the progeny testing programme and on the half sib progeny of three 
of these rams (n = 58 (29, 17, and 12 progeny respectively)) and also on 2 Cambridge 
rams one of whom was progeny tested. In addition, seven purebred daughters of two of 
the Belclare rams were tested along with four of the five dams involved. 

* 

GDF9B genotypes were determined by analysis of three nucleotide fragments which 
spanned most of exon 2. Fragments analysed by SSCP were: 

353 bp, exon 2 primer 9B-359 5'-CGC TTTGCT CTT GTT CCC TCT-3' 

primer 9B-691 5'-CCT CAC TAC CTC TTG GCT GCT-3' 
273 bp, exon 2 primer 9B-664 5'-GGG TTC TAC GAC TCC GCT TC-3' 

primer 9B-916 5'-GGT TAC TTT CAG GCC CAT CAT-3' 
312 bp, exon 2 primer 9B-915 5'-CAT GAT GGG CCT GAA AGT AAC-3' 

primer 9B-1205 5'-GGC AAT CAT ACC CTC ATA CTC C-3' 

Primers were designed from nucleotide sequence Genbank Accession number 
AF236079 and primer names correspond to nucleotide position within that sequence. 
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GDF9 genotypes were determined by analysis of five fragments which spanned exon 1 
part of the intron and most of exon 2. Fragments analysed by SSCP were: 

462 bp, exon 1 primer G9-1734 5'-GAA GAC TGG TAT GGG GAA ATG-3 ' 



5 294 bp, intron 



primer G9-2175 5'-CCA ATC TGC TCC TAC ACA CCT-3' 

primer G9-2676 5'-GTG TGA GAG AGA TGG GAG CA-3' 

primer G9-2947 5'-AAG AGG AAA ACT ATC AAA AGA 
CA-3' 



296 bp, exon 2 primer G9-3270: 5'-TGG CAT TAC TGT TGG ATT GTT TT- 

3' 

10 primer G9-3546: 5'-CAA GAG GAG CCGTCA CAT CA-3' 

206 bp, exon 2 primer G9-3543: 5'-GAT TGA TGT GAC GGC TCC TCT-3' 

primer G9-3728: 5'-GGG AAT GCC ACC TGT GAA AAG-3' 
221 bp, exon 2 primer G9-3939: 5'-TCT TTT TCC CCA GAA TGA ATG T-3' 

primer G9-4140: 5'-CAC AGG ATG GTC TTG GCA CT-3' 

15 Primers were designed from nucleotide sequence Genbank Accession number 
AF078545 and primer names correspond to nucleotide position within that sequence. 

Amplification was carried out for 30 cycles in a 40 uL reaction mixture, using 150 ng 
of genomic DNA, with 1.5 mM or 3 mM magnesium and an annealing temperature of 
55 to 58° C. PCR fragments were analysed by SSCP in polyacrylamide gels with 
20 overnight migration at 9-15 V/cm, 15°C. 
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Single Nucleotide Polymorphism Detection Assays 

The [El] polymorphism identified in GDF9 exon 1 produced a G to A nucleotide 
change which disrupts a Hha I restriction enzyme cleavage site (GCGC to GCAC) at 
nucleotide 260 of the 462 bp PCR fragment produced by primers G9-1734 and G9- 

5 2175 above. Digestion was carried out using 9 ul of PCR product and 3 U Hha I in 15 
Hi final volume, for 6 h at 37° C. Restriction digestion of the PCR product from 
wildtype animals with Hha I resulted in cleavage of the 462 bp product (at two internal 
Hha I sites) into fragments of 52 bp, 156 bp and 254 bp. However, DNA fragments 
containing the A nucleotide are not cleaved at this site and fragment sizes of 52 bp and 

10 410 bp are seen. Animals heterozygous for the mutation have fragments of all four 
sizes (52 bp, 156 bp, 254 bp and 410 bp). 

The remaining single nucleotide polymorphisms (SNPs) in GDF9 and GDF9B 
identified by sequencing did not affect common restriction endonuclease cleavage 
sites. In order to screen these polymorphisms through the F700 Belclare and 

15 Cambridge flocks of sheep, PCR was carried out using primers with single mismatches 
in order to deliberately generate products that contained restriction enzyme sites. 
Assays have been designed so that digestion with the appropriate restriction enzyme 
cleaves either PCR products from wild-type animals or PCR products from animals 
containing the SNP, as specified below. The resulting band shift can be resolved on a 

20 high percentage agarose gel. The primer sequences and PCR conditions for each assay 
are as follows. The mismatch created in the appropriate primer to generate the 
restriction enzyme cleavage site is underlined. 

In all five assays below, amplification was carried out at: 94°C for 5 min; 35 cycles of 
94°C for 30 sec, an annealing step for 40 sec (at the specific temperature stated below 
25 for each assay) and 72°C for 30 sec; followed by a final extension of 72°C for 4 min. 
Magnesium concentration was 1.5 mM. 
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The primers used for the GDF9 [324] nucleotide change amplify a 161 bp PCR 
product. 

[324]-SfulF 5 ' -GGAAT ATTC AC ATGTCTGT AAATTTTAC ATGTTCG-3 ' 
[324]-Sfu3R 5'-GAGGGAATGCCACCTGTGAAAAGCC-3' 
5 Annealing temperature 63 °C 

Non-wildtype strand cleaved by restriction enzyme Sfu I 

The primers used for the GDF9 [714] nucleotide change amplify a 158 bp PCR 
product. 

[7 1 4]-Tru 1 R 5 ' -C AGTATCGAGGGTTGTATTTGTGTGGGGCCT-3 ' 
10 [714]-Tru3F 5' -GCCTCTGGTTCCAGCTTC AGTC-3 ' 
Annealing temperature 63 °C 

Non-wildtype strand cleaved by restriction enzyme Mse I 

The primers used for the GDF9 [787] nucleotide change amplify a 139 bp PCR 
product. 

15 [787]-DdelR: 5'-CATGGATGATGTTCTGCACCATGGTGTGAACCTGA-3' 
[787] -Dde3F: 5 ' -CTTT AGTC AGCTGAAGTGGGAC AAC-3 ' 
Annealing temperature 62°C 

Wildtype strand cleaved by restriction enzyme Dde I 

The primers used for the GDF9B [SI] nucleotide change amplify a 141 bp PCR 
20 product. 



« 
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[Sl]-HinflF: 5'-CACTGTCTTCTTGTTACTGTATTTCAATGAGAC-3' 
B26: 5 ' -GATGC AATACTGCCTGCTTG-3 ' 

« 

Annealing temperature 63°C 

Wildtype strand cleaved by restriction enzyme Hinfl 

5 The primers used for the GDF9B [S2] nucleotide change amplify a 153 bp PCR 
product. 



[S2] 



-DdelF: 5'-GCCTTCCTGTGTCCCTTATAAGTATGTTCCCCTTA-3' 



04: 5'-TTCTTGGGAAACCTGAGCTAGC-3' 
Annealing temperature 64°C 

« 

10 Wildtype strand cleaved by restriction enzyme Dde I 

Restriction digestion of PCR with Hinfl [SI] or Dde I [[787] and [S2]] resulted in a 
cleavage of the longer primer from the fragment amplified from wild-type alleles (thus 
producing a 30-35 bp smaller product than the uncleaved fragment from animals 
containing alleles with the mutation). Restriction digestion of PCR with Sfu I [324] or 

15 Mse I [714] resulted in a cleavage of the longer primer from the fragment amplified 
from mutant alleles (thus producing a 30-35 bp smaller product than the uncleaved 
fragment from animals containing wild-type alleles). Animals heterozygous for any of 
the mutations have fragments of both sizes. The digested fragments were separated on 
a 4% agarose gel and visualised with ethidium bromide staining. The gels were scored 

20 for the presence or absence of the mutations. Homozygous, heterozygous and negative 
controls were included with each assay. 
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Immunisation experiments 

m • 

* 4 

All experiments were performed with the approval of the Animals Ethics Committee at 
Wallaceville Animal Research Centre in accordance with the 1987 Animal Protection 
(Codes of Ethical Conduct) Regulations of New Zealand. The animals used in the 
5 immunization studies (n=54) were 5 to 6 year old parous Romney ewes. 

Generation of antigens for immunization of sheep 

Peptides KKPLVPASVNLSEYFC (GDF9) and SEVPGPSREHDGPESC (GDF9B) 
were synthesized and conjugated to KLH through the C terminal cysteine residue by 
Macromolecular Resources (Colorado State University, Fort Collins, CO). 

10 Active immuniiation of ewes against GDF9 and GDF9B peptides 

Ewes were injected (i.m) with 0.4 mg KLH (control, n=10), 0.4 mg KLH-GDF9 
peptide conjugate (GDF9 peptide; n=10) or 0.4 mg KLH-GDF9B peptide conjugate 
(GDF9B peptide; n=10) in 1 ml of Freund's complete adjuvant for the initial 
immunization. Thereafter, ewes were immunized once monthly with 0.2 mg KLH, 
15 GDF9 peptide or GDF9B peptide in 1 ml of saline mixed with 1.25 ml STM (Span- 
Tween-Marcol) for 6 months. After the 5 th injection, vasectomised rams with marking 
harnesses were run with the ewes to monitor estrous cycles. The length of the estrous 
cycle was calculated as the days between first observed markings by the vasectomised 
ram of successive cycles. In addition, blood samples were collected via the jugular 
20 vein 3 times a week for determination of plasma progesterone concentrations. 
Ovulation rates of the ewes that displayed estrous behaviour were determined by 
laparoscopy once all of the control ewes had been observed in estrus and for each 
successive estrous cycle. In addition, ovulation rate of all ewes was determined by 
laparoscopy 3-4 weeks prior to ovarian collection. Approximately 2 weeks following 
25 the final injection, ewes were killed using a captive bolt and exsanguinated. The blood 
collected from all ewes was to be used in subsequent passive immunisation studies. 
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Both ovaries were recovered and the number of corpora lutea present was recorded and 
one ovary from each ewe was fixed in Bouins fluid for morphological examination and 
analysis of follicular populations. 

i 

Passive immunisation of ewes against KLH, KLH-GDF9 peptide and KLH-GDF9B 
peptide 

Pools of antiplasma from KLH (n=9), GDF9 peptide (n=7, all anovulatory ewes) and 
GDF9B peptide (n=9, all anovulatory ewes) treated ewes were generated by combining 
the plasmas obtained from some of the actively immunized ewes within each treatment 
group. The estrous cycles of ewes were synchronized by using a prostaglandin 
) derivative (Estrumate; 125ug). Estrus was detected with the aid of a vasectomised 
ram wearing a marking harness. On day 4 or 5 of the estrus cycle (estrus = day 0) ewes 
were laparascoped to determine ovulation rate and fitted with an indwelling jugular 
cannula. The following day ewes (n=4-5 per group) were administered 100 ml of 
antiplasma to KLH, GDF9 peptide or GDF9B peptide through the indwelling jugular 
5 cannula. Ewes were given another injection of Estrumate, at .96h after administration 
of the antiplasma to induce a follicular phase and ovulation rate was determine by 
laparoscopy at 10 days after the injection of Estrumate and every 15-18 days thereafter 
until the end of the breeding season (as assessed by lack of estrous activity in non- 
experimental sheep). Blood samples were collected from the ewes at 5 minutes, 1 h 
20 and 96 h after injection of the antiplasma and thereafter 3 times a week from the 2 
injection of Estrumate for determination of antibody titers and concentrations of 
progesterone in plasma. 

Determination of progesterone concentrations 

Concentrations of progesterone in plasma were determined by RIA as described 
25 previously (McNatty et al., 1981). The inter- and intra-assay co-efficients of variation 
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were 



<10% and assay sensitivity was 0.1 ng/ml. All samples below the sensitivity of 
the assay were assigned a value of 0. 1 ng/ml for statistical analysis. 

Short-term immunisations 

Romney ewes were immunised with either KLH, KLH conjugated to GDF9 peptide or 
5 KLH conjugated to GDF9B peptide. The antigens were administered in DEAE Dextran 
(4% w/v) on 2 occasions one month apart. Thereafter the ovaries of these animals 
were visualised following exteriorisation via a mid-line incision and the number of 
corpora lutea counted (two observations at successive cycles per ewe). 

Statistical analysis 

10 For the long-term, actively immunized ewes, ovulation rate for individual ewes was 
calculated as the mean of the number of corpora lutea observed at all observations for 
that ewe when at least 1 corpus luteum (CL) was present (i.e. observations of no CL 
were excluded from the calculation). The Kruskal-WalUs test was used to compare 
ovulation rates between the KLH-GDF9B mature protein and the KLH treated groups. 
15 No other groups were included in this comparison since none had sufficient numbers 
of ewes ovulating. The Chi Square test was used to compare the proportion of ewes 
observed in estrus by the time all the control ewes had been observed in estrus. In 
addition the Chi Square test was used to compare the proportion of ewes with corpora 
lutea on their ovaries 3-4 weeks before and at ovarian collection. 

20 When examining the effects of active immunization treatments on ovarian volumes, 
numbers of follicles or oocyte or follicular diameters, the data were analysed within 
each follicle type after normalising the data by log transformation. For each parameter 
a one-way ANOVA was performed, after blocking on animals where appropriate, and 
differences between treatment groups were determined by least significant difference. 
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For the passively immunized ewes, differences in the number of ewes with corpora 
lutea at each laparoscopy were determined using Fisher's exact test. The areas under 
the curves were calculated using Genstat using the area function for progesterone 
values from 2 to 19 days following injection of Estrumate that was given 4 days after 
administration of plasma. Resulting values were analysed with one-way ANOVA and 
differences between the control and treated ewes determined with Fisher's pairwise 



comparisons. 



For the short term active immunizations, ovulation rate for individual ewes was 
calculated as the average of the number of corpora lutea observed at both observations. 
10 Data was analysed using the general linear models procedures of SAS. Differences 
between least-squares means were evaluated by least significant differences. 

RESULTS and ANALYSIS OF RESULTS 

Finding mutations in Cambridge and F700 Belclare animals 

In order to determine whether mutations in GDF9 or GDF9B were contributing to 
15 sterility in these animals sequence information was obtained for the entire coding 
sequence of both genes in a subset of Irish Cambridge (N = 9) and F700 Belclare sheep 
(N = 10). Animals were chosen for full-length sequencing based on their sterility 
phenotype or their pedigree relationship to sterile animals. In addition, mutation 
detection was also carried out by single-stranded conformational polymorphism 
20 (SSCP) analysis independently of the above sequencing in F700 Belclare pedigrees 
(23 animals and 58 progeny test daughters of three rams) and also on 2 Cambridge 



rams. 



Mutations in GDF9 



Sequence of GDF9 revealed eight single nucleotide polymorphisms across the entire 
25 coding region (Table 1, Figure 4). SSCP analysis identified five fragments across the 
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gene which contained conformational differences. These differences correspond to one 
single nucleotide polymorphism (SNP) in exon 1, one SNP in the intron and five SNPs 
in exon 2. 

Original naming of the mutations (numbers in square brackets [ ]. Table 1) refers to the 
5 nucleotide position from the start of exon 2, except for [El] which refers to the 
polymorphism found in exon 1 of GDF9. Table 1 shows the relationship between (a) 
the original numbering system, (b) the coding nucleotide position in the full length 
coding sequence numbered from the first atg, (c) the position of the coding amino acid 
residue involved, and (d) the position of the coding residue (if any) within the mature 
10 coding sequence, numbering from the first amino acid after the furin protease 
processing site. For example mutation [581] is a G to A nucleotide substitution at 
coding nucleotide 978 of GDF9 which corresponds to an unchanged glutamate (Glu) 
coding residue 326 of the full length unprocessed protein, or residue 8 of the processed 
mature peptide. 

15 Three of the eight polymorphisms are nucleotide changes which do not result in an 
altered amino acid ( [74] at nucleotide position 471, [80] at nucleotide 477, and [581] 
at nucleotide position 978). The five remaining nucleotide changes [324], [597], [714], 
and [787] give rise to amino acid changes (Table 1), Figure 1, Figure 4), although three 
of them are relatively conservative changes. The [El] arginine to histidine change at 
20 amino acid residue 87 in exon 1 substitutes one basic charged polar group with 
another, and occurs at a position prior to the furin processing site, so is unlikely to 
affect the activity of the mature protein. Both the [597] valine to isoleucine change at 
amino acid residue 332 of the unprocessed protein (residue 14 of the mature coding 
region) and the [714] valine to methionine at residue 371 of the unprocessed protein 
25 (residue 53 of the mature coding region) substitute non-polar groups with non-polar 
groups. The remaining two changes result in non-conservative substitutions. The [324] 
glutamic acid to lysine change at amino acid residue 241 of the unprocessed protein 



43 



replaces an acidic group with a basic group, but this occurs at a position prior to the 
furin processing site and is unlikely to affect the mature active coding region. However 
the [787] serine to phenylalanine change at residue 395 replaces an uncharged polar 
group with a non-polar group at residue 77 of the mature coding region. The nucleotide 
5 and amino acid changes are illustrated in Figure 1 and Figure 4. 

Mutations in GDF9B 

Both DNA sequencing and independent SSCP analysis of GDF9B in Cambridge and 
F700 Belclare sheep revealed four polymorphisms across the entire coding region 
(Table 1, Figure 5). Original naming of these mutations (in square brackets [ ]) refers 

10 specifically to the leucine deletion [Leu], or for the conservative [422] T to C 
mutation, the nucleotide position from the start of exon 2. GDF9B mutations which 
changed amino acids were named [SI] and [S2]. Table 1 shows the relationship 
between (a) the original numbering system, (b) the coding nucleotide position in the 
full length coding sequence numbered from the first atg, (c) the position of the coding 

15 amino acid residue involved, and (d) the position of the coding residue (if any) within 
the mature coding sequence, numbering from the first amino acid after the furin 

* 

protease processing site. For example mutation [S2] is a G to T nucleotide substitution 
at coding nucleotide 1100 of GDF9B which corresponds to an serine (Ser) residue 
changing to an isoleucine (lie) residue at coding residue 367 of the full length 
20 unprocessed protein, or residue 99 of the processed mature peptide. 

The first of these four polymorphisms (Table 1) is a previously-reported leucine 
deletion polymorphism [leu] in the predicted signal sequence (Galloway et al, 2000) 
whereby some sheep have two leucine codons (CTT) at this position and some sheep 
have only one. This polymorphism has been shown to be unrelated to fertility and 
25 ovulation rate in Inverdale sheep (Galloway et al, 2000). One other nucleotide change, 
[422], does not result in an altered amino acid (nucleotide position 747). The 
remaining two nucleotide changes ( [SI] and [S2] )give rise to more critical changes in 
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the protein (Figure 2, Figure 5). The [SI] C to T change at nucleotide 718 introduces a 
premature stop codon (TAG) in the place of glutamic acid (Q, CAG) at amino acid 
residue 239 of the unprocessed protein, which presumably results in complete loss of 
GDF9B function. The [S2] G to T change at nucleotide 1100 changes the serine 
5 residue at amino acid 99 of the mature active protein (residue 367 of the unprocessed 
protein) to an isoleucine, thereby substituting an uncharged polar group with a 
nonpolar group. The nucleotide and amino acid changes are illustrated in Figure 2 and 
Figure 5. 

Screening for mutations in more animals 

10 Initial sequencing of a smaller number of animals from each family identified the [74], 
[80], [324], [714], and [787] nucleotide changes in GDF9, and the [SI], [S2] and [422] 
changes in GDF9B. Forced RFLP (restriction fragment length polymorphism) assays 
to detect the specific SNPs were developed for [324], [714], [787] (GDF9) and for 
[SI] and [S2] (GDF9B), and these assays were carried out on larger numbers of 

15 animals (Table 2). Subsequent sequencing of full length GDF9 and GDF9B in more 
animals revealed the [581] and [597] nucleotide changes in GDF9 in the Cambridge 
sheep but not the F700 Belclares. Independent SSCP analysis identified the [El] 
polymorphism in exon 1 of GDF9 in one ram, and this was also screened through 
further animals. [El] was found to be associated with the wildtype alleles in this ram 

20 and his backcross progeny, and not associated with ovulation rate . 

Homozygous mutations relate to sterility 

Presence or absence of each of these nucleotide changes was analysed in relation to 
sterility or fertility in all of the animals tested, revealing that only the [787] change in 
GDF9 and the [SI] and [S2] changes in GDF9B contributed to infertility. Female 
25 sheep which are homozygous for [787] are sterile; female sheep which are 
homozygous for [SI], or homozygous for [S2] are sterile; female sheep which are 
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heterozygous for [SI] and [S2] simultaneously (whereby each each chromosome 
carries a different GDF9B mutation) are sterile. Figures 3 a and 3b show small 
pedigrees illustrating what is seen in the larger set of animals. 

Figure 3a illustrates a F700 Belclare pedigree. The sire R830 carries the GDF9B [S2] 
mutation on his X chromosome and the GDF9 [787] mutation on chromosome 5, but 
does not have the GDF9B [SI] mutation. Dam 9704 carries a single copy of the 
GDF9B [SI] mutation on her X chromosome and their two female offspring (930458 
and 930459) are sterile since they have inherited inactive copies of GDF9B from both 
parents. Dam 8783 carries a single copy of the GDF9 [787] mutation on chromosome 5 
and the female offspring of her mating with sire R830 are infertile and are homozygous 
for the GDF9 [787] mutation. Their infertility cannot be explained by GDF9B 
mutations. Offspring 930810 and 948302 are not homozygous for any of these 
mutations and hence are fertile. All three functional mutations ( [SI], [S2], and [787] ) 
were seen in the F700 Belclare flock (Table 2). 

« 

Figure 3b illustrates two Cambridge pedigrees. The sire 962101 carries the GDF9B 
[SI] mutation on his X chromosome and the GDF9 [787] mutation on chromosome 5, 
but does not have the GDF9B [S2] mutation. Dam 962152 carries a single copy of the 
[SI] mutation on her X chromosome and a single copy of the [787] mutation on 
chromosome 5. Their two female offspring (997634 and 997635) are sterile and have 
inherited inactive copies of both GDF9B ( [SI] ) and GDF9 ( [787] ) from both 
parents. Dam 976234 only carries a single copy of the [SI] mutation and one female 
offspring (997553) is infertile, having inherited inactive copies of GDF9B ([SI]) form 
both parents, whereas 997552 is fertile. Sire 930142 is homozygous for the GDf9 
[787] mutation and carries the GDF9B [SI] mutation on his X chromosome, whereas 
dam 8874 is only heterozygous for the GDF9 [787] mutation and carries no GDF9B 
mutation. Their daughter (948093) has inherited two copies of the GDF9 [787] 
mutation and is sterile even though she also heterozygous for the GDF9B [SI] 
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mutation which she inherited from her sire. The [S2] mutation was not seen in any 
animals tested from the Cambridge flock (Table 2). 

Among the animals tested for these changes we found fertile animals homozygous for. 
GDF9 [324] and [714] and conclude that neither of those changes result in disruption 
of the genes sufficient to cause sterility. We also found animals which were 
heterozygous for GDF9 and GDF9B mutations together, and these animals were not 
sterile. 

* 

Structural effects of mutations on activity 

Sufficient structural data has been obtained for members of the TGF(3 superfamily to 
provide information about the likely effects of each of the three mutations ( [SI], [S2] 
and [787] ) on the biological activity of GDF9 and GDF9B, and hence explain the 
association with sterility. Structures have been reported for TGF pi (Hinck et al, 
1996), TGF-P2 (Daopin et al., 1992; Schlunegger and Grutter, 1992; Schlunegger and 
Grutter, 1993), TGF-p3 (Mittl et al, 1996), BMP7/OP1 (Griffith et al, 1996) and 
BMP2 (Scheufler et al, 1999). Receptor binding structures have also been reported for 
BMP2 with the BRIA receptor binding ectodomain (Kirsch et al., 2000a) and for TGF- 
p3 with ecTpR2 receptor binding ectodomain (Hart et al, 2002). 

The [SI] mutation results in premature termination of GDF9B protein prior to the 
mature active protein processing site. This mutation would result in no mature protein 
being translated, and is an even more severe effect than the Hanna mutation (Galloway 
et al., 2000) which results in infertility in sheep. The GDF9B [S2] mutation changes an 
uncharged polar serine residue (residue 99 of mature GDF9B) which is conserved 
across most members of the TGFp superfamily, to a non-polar isoleucine (Figure 6). 
This serine (and the nearby conserved leucine) has been shown to be essential for 
receptor binding by structural and site-directed mutagenesis studies of BMP2 (Kirsch 
et al, 2000b). In F700 Belclare sheep it appears that this mutation abolishes biological 
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activity of GDF9B, presumably by affecting . receptor binding. The GDF9 [787] 
mutation changes an uncharged polar serine residue (residue 77 of mature GDF9) to a 
non-polar phenylalanine in a region of the molecule which is involved in dimerisation. 
This change occurs only three residues away from a conserved histidine (H80) of the 
mature GDF9 peptide (Figure 6). In BMP7 this conserved histidine exhibits hydrogen 
bonding to three residues of the paired molecule in the BMP7 dimer (Griffith et al., 
1996) and TGFp3 (Mittl et al., 1996). GDF9 lacks the interchain disulphide bond 
which forms a covalent link between both monomers of the biologically active dimer 
that is found in most other members of the TGFp superfamily. This makes it likely that 
in GDF9 the hydrogen bonds between monomers are even more critical for 
maintaining dimer stability, and suggests that the GDF9 [787] mutation could be 
abolishing biological activity by disrupting dimerisation. 

Heterozygous animals have increased ovulation rate 

Irish Cambridge and F700 Belclare sheep have increased ovulation rates as well as 
infertility (Hanrahan, 1996). We examined ovulation rate data that was available for 
the fertile ewes which had been genotyped for the [SI], [S2] and [787] mutations 
(Table 3). Heterozygous carriers of mutations in GDF9B (either [S 1] or [S2] ) show 
increased ovulation rates similar to those seen in Inverdale and Hanna sheep (Davis et 
al, 2001). Interestingly we see here for the first time that sheep heterozygous for the 
GDF9 [787] mutation also show increased ovulation rates and this increase is larger 
than for heterozygous carriers of GDF9B mutations (Table 3). It also appears that 
animals which are heterozygous for both a GDF9B mutation and a GDF9 mutation 
have an even higher ovulation rate, and this effect is probably additive. The effects of 
GDF9 [787] and GDF9B [SI] are additive in the Cambridge where the number of ewes 
in each genotype class was reasonably balanced and the effect of one copy of GDF9 
[787] was twice as large as one copy of GDF9B [SI]. In the F700 Belclare the effect 
of one copy of GDF9B [S2] is close to the effect for [SI] in the Cambridge. The 
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evidence indicates that the effect of GDF9 [787] in the F700 Belclare is if similar 
magnitude as in the Cambridge and that the combination of GDF9B [S2] and GDF9 
[787] give an ovulation rate that is similar to. that from GDF9B [SI] and GDF9 [787] 

4 

in Cambridge. Progeny test data for a set of nine F700 Belclare rams with genotype 
information for GDF9 and GDF9B mutations provides evidence that the effect of [S 1] 
is 0.6 ; [S2] is 1.1 and GDF9 [787] is 1.8 . This suggests that the effect of [S2] on 
ovulation rate is greater than the effect of [S 1], 

Effect of immunising sheep against GDF9 

In addition to the genotype effects above we have also shown that both long-term 
active immunisation and short-term passive immunisation of sheep with GDF9 causes 
sterility and/or abnormal corpus luteum function, and this finding provides additional 
evidence that a homozygous GDF9 mutant phenotype is one of sterility. 

Repeated laparoscopy of ewes which had been in estrus and which were then were 
actively immunised against Kin (control, n = 9) or KLH conjugated to GDF9 peptide 
(n = 10) showed that no ewe actively immunised with GDF9 peptide showed cyclic 
estrous behaviour (Table 4). High (normal) progesterone concentrations was only seen 

i 

in one or two samples (Figure 7), and most times when corpora lutea and/or luteal-like 
structures are observed following long-term immunisation against KLH conjugated to 
GDF9 peptide, progesterone concentrations are abnormal. In addition many of the 
ewes did not have any visible antral follicles at laparoscopy or at ovarian collection. 
Figure 7 also shows data for ewes immunised against GDF9B/BMP 1 5 which we have 
shown previously (refer WO 01/96393), and which is included here for comparison 
with GDF9. 

Passive immunisation using 100 ml of GDF9 peptide antiplasma can cause abnormal 
luteal function within 30 days of administration of the antiplasma (Figure 8). There 
were no differences in ovulation rates among the groups before administration of the 
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antiplasma. Administration of antipjasma against GDF9 peptide 4 days before 
induction of the follicular phase did not affect ovulation rate. However, at laparoscopy 
the corpora lutea of two of the animals treated with GDF9 antiplasma appeared smaller 
than normal. In addition, the overall mean concentration of progesterone during the 

♦ 

5 subsequent luteal phase was less (P<0.05) than that observed in the control animals 
(Figure 8). This was the result of the progesterone concentrations being normal in two 
of the animals but in the other three animals, the post ovulation rise in progesterone 
was delayed even though luteolysis occurred at the normal time. Figure 8 also shows 
data for passive immunisation with GDF9B/BMP15 which we have shown previously 

10 (refer WO 01/96393), and which is included here for comparison with GDF9. 

In another new experiment we show that short-term active immunisation of sheep with 
GDF9 or GDF9B can mimic the heterozygous effects of mutations in these genes. 
Short-term immunisation using milder adjuvant than in the previous experiments (2 
immunisations in DEAE-Dextran adjuvant), with either KLH conjugated to GDF9 
15 peptide or with KLH conjugated to GDF9B peptide, acts to increase ovulation rate in 
the animals which ovulated (Table 5). Moreover, more corpora lutea were evident in 
animals treated with GDF9 peptide than those treated with GDF9B. 

CONCLUSIONS 

These findings provide the first evidence that mutations in GDF9 and GDF9B are 
20 associated with the reproductive effects seen in the Cambridge and Belclare breeds of 
sheep. The increased ovulation rate and sterility phenotypes in these animals can be 
explained by the presence of heterozygous mutations and homozygous mutations, 
respectively, in these genes. 

The two new mutations in GDF9B described here support the evidence from previous 
25 descriptions of mutations in this gene in sheep (Galloway et al. 9 2000). Inactivating 
mutations in GDF9B cause increased ovulation rate and infertility in a dosage 
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dependent manner. The serine to isoleucine change in carriers of the [S2] mutation 
supports the notion that small perturbations of protein structure within the GDF9B 
mature peptide have serious consequences in protein activity. 

The discovery of an inactivating mutation in GDF9 associated with infertility and 
5 increased ovulation rate in sheep is the first evidence that GDF9 is also important for 
increasing ovulation rate. Although a knockout mutation of GDF9 in mice has been 
shown to cause infertility, no effects for GDF9 on increasing ovulation rate have been 
described. Our discovery shows that small pertubations of protein structure within the 
GDF9 mature peptide also have severe consequences on protein activity. 

10 These new mutations in GDF9B and GDF9 together provide strong support for the 
likelihood that other amino acid changes in the receptor-binding and dimerisation 
domains, or regions of the protein that disrupt protein folding of the mature peptide are 
likely to have similar effects. In addition, the effects of a GDF9 mutation and a GDF9B 
mutation together in one animal appear to be additive, implying that GDF9 and 

15 GDF9B are working independently, and that combinations of both proteins can be used 
to alter ovarian function more effectively than by altering either GDF9 or GDF9B 
alone. 

Our finding of abnormal luteal function following GDF9 immunisation has not been 
previously reported. Given that corpus luteum function is often abnormal in the GDF9 

20 immunised animals (both passive and active immunisations) it is likely that the 
administration of supplementary GDF9 or GDF9B, or GDF9 or GDF9B antagonists 
may modify corpus luteum function. More importantly, the observation that an 
increased ovulation rate effect (ie as seen in heterozygous carriers of these inactivating 
mutations) can also be induced in sheep by short-term active immunisation against 

25 peptides of GDF9 and GDF9B provides new methods for altering ovarian function. 
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TABLE 1 



Sequence variations in GDF9 and GDF9B within the Irish Cambridge and F700 Belclare F700 



flocks 



Gene [original nucl. coding coding mature result 
name] change nucl.(bp) residue residue 



GDF9 [El] G-A 260 87 



Arg (R) - His (H) 



[74] 



C-T 471 



157 



unchanged Val (V) 



[80] 



G-A 477 



159 



unchanged Leu (L) 



10 



[324] G-A 721 241 

[581] A-G 978 326 8 



Glu (E) - Lys (K) 
unchanged Glu (E) 



[597] G-A 994 332 



14 



Val (V) - lie (I) . 



[714] G-A 1111 371 



53 



Val (V) - Met (M) 



[787] C-T 1184 395 77 



Ser (S) - Phe (F) 



GDF9B [Leu] CTTdel 28-30 10 



Leu deletion 



15 



[SI] 



C-T 718 239 



Gin (Q) - STOP 



[422] T-C 747 249 



unchanged Pro (P) 



[S2] 



G-T 1100 367 



99 



Ser (S) -He (I) 



Columns indicate the relationship between (a) the original naming system used for each 
polymorphism, (b) the nucleotide change, (c) the coding nucleotide position (in base pairs 
20 (bp)) in the full length coding sequence numbered from the first atg start codon, (d) the 
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position of the coding amino acid residue involved (starting from the first Met residue), and 
(e) the position of the coding residue (if any) within the mature coding sequence, numbering 
from the first amino acid after the furin protease processing site. For example mutation [581] 
is a G to A nucleotide substitution at coding nucleotide 978 of GDF9 which corresponds to an 
5 unchanged glutamate (Glu) coding residue 326 of the full length unprocessed protein, or 
residue 8 of the processed mature peptide. Polymorphisms associated with infertility and 
ovulation rate traits are in bold. 

* 

TABLE 2. Genotype analysis of nucleotide changes in GDF9B and GDF9 genes from 
Irish Cambridge and F700 Belclare sheep. 

10 Numbers shown are the number of individuals carrying at least one copy of the given 
mutation with the total number of individuals genotyped indicated in brackets 
underneath. Genotypes were determined by specific SNP assay and/or sequencing. 

* The [El] polymorphism in exon 1 of GDF9 (see Table 1) was identified by SSCP 
analysis and was not tested in the same set of animals used for the above table. 



GDF9B 


GDF 9* 




[SI] 


[S2] 


[422] 


[74] 


[80] 


[324] 


[581] 


[597] 


[714] 


[787] 


F700 
Belclares 


9 
(83) 


71 
(86) 


2 
(13) 


6 
(10) 


6 
(10) 


13 
(29) 


0 
(10) 


0 
(10) 


2 
(19) 


11 
(86) 


Cambridge 


74 
(129) 


0 

(131) 


0 

(9) 


0 

(9) 


7 

(9) 


1 

(26) 


3 

(9) 


2 

(9) 


7 
(24) 


95 
(126) 
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TABLE 3. Least squares means for ovulation rate of sheep carrying the different 
genotypes for GDF9 and GDF9B mutations 



Genotype 
SI 


S2 


GDF9 


Breed 

F700 Belclare 


Cambridge 


0 


0 


0 


1.92±0.277(n=ll) 


2.27±0.488 (n = 10) 


0 


0 


1 


2.67±0.895 (n = 1) 


4.39±0.308 (n = 28) 


0 


1 


0 


3.26±0.184 (n = 32) 




0 


1 


1 


6.09±0.549 (n = 3) 




1 


0 


0 


2.69+0.475 (n = 4) 


3.11+0.438 (n = 15) 


1 


0 


1 




5.77±0.270(n = 38) 


Effect of GDF9B [SI] 




0.77±0.537 (P - 0.16) 


1.18±0.387(P<0.01) 


Effect of GDF9B [S2] 




2.38+0.548 (P<0.01) 




Effect of GDF9 [787] 




1.79±0.548 (P<0.01) 


2.35±0.392 (P<0.01) 


Interaction 






-2.08±1.096(P = 0.06) 


-0.55±0.774 (P = 0.41) 



( ) = no. of ewes 

TABLE 4. Proportions of ewes immunized against KLH, GDF9 peptide or GDF9B peptide in 
estrus at the time of first laparoscopy (1 st ), with visible luteal structures at laparoscopy 3-4 
weeks before collection (2 nd ) and at ovarian collection (3rd). 



Immunized Group 


1 st 


2 na 


3 rd 


KLH 


9/9 


9/9 


9/9 


GDF9 peptide 


2/10* 


2/10* 


3/10* 


GDF9B peptide 


1/10* 


1/10* 


1/10* 











*Signifies a value that is different from the control (KLH) value (P<0.05) 
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TABLE 5. Short-term immunisation of sheep with GDF9 or GDF9B 
Treatment Ovulation rate (mean ± sem) 

(n = no. of animals) 

KLH 2.1 ± 0.1 

(14) 

GDF9 peptide 3.1 ± 0.4** 

(7) 

GDF9B peptide 2.8 ± 0.3* 

(6) 

Results are expressed as average ovulation rates ± standard error of the mean (sem). In 
the ovulation rate column asterisks indicate a value that is significantly different than 
the control (KLH) group ( * <0.05, **<0.01) 

Aspects of the present invention have been described by way of example only and it 
should be appreciated that modifications and additions may be made thereto without 
departing from the scope thereof. 
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SEQUENCE LISTING 

<110> Galloway, Susan May 
Davis, George Henry 
Gregan, Scott Michael 
Hanrahan, 3ames Patrick 
uuengal , Jennifer Lee 
McNatty, Kenneth Pattrick 
Mulsant, Philippe 
powel 1 , Ri chard Pat ri ck 

<120> New sequences for altering mammalian ovarian function 

<130> 

<160> 

■ 

<170> paten tin version 3.0 

<210> 1 

<211> 1879 

<212> DNA 

<213> Ovis aries 

<220> 

<221> 5'UTR 
<222> (1) . . (121) 

<220> 

<221> misc_feature 

<222> (122) . . (124) 

<223> atg start codon. 

* 

<220> 
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CDS 

(122) . . (518) 
<220> 

<221> CDS 

<222> (839) . . (1800) 
<220> 

<221> intron 
<222> (519) . . (838) 

<223> n at 709 represents remainder of approx 900 bp unsequenced of the 
approx 1.1 kb intron 

<220> 

<221> mat_peptide 
<222> (1396).. () 

<220> 

<221> mis cofeature 

<222> (709) . . () 

<223> n represents approximately 900 bp of unsequenced intron. 
<220> 

<221> mi sc_f eatu re 

<222> (1801) . . (1803) 

<223> taa stop codon. 

<220> 

<221> 3'UTR 

<222> (1804) . . (1879) 

<220> 

<221> mutation 
<222> (1624) . . (1626) 
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3> c to t at 1625 in [787] sheep changing tct serine codon to ttt ph 
enyl al anine 



<400> 1 

gaattgaacc tagcccaccc acacacctaa agtttattta agagaccaac cgaggctctt 60 

cctggttttt aggaagaaga ctggtatggg gaaatgtgtt ccttgctaat tcttccaagc 120 

c atg gcg ctt ccc aac aaa ttc ttc ctt tgg ttt tgc tgc ttt gcc 166 
Met Ala Leu Pro Asn Lys Phe Phe Leu Trp Phe Cys Cys Phe Ala 

-315 -310 -305 

tgg etc tgt ttt cct att age ctt gat tct ctg cct tct agg gga 211 
Trp Leu cys Phe Pro lie ser Leu Asp ser Leu Pro ser Arg Gly 

-300 -295 -290 



gaa get cag att gta get agg act gcg ttg gaa tct gag get gag 
Glu Ala Gin lie val Ala Arg Thr Ala Leu Glu Ser Glu Ala Glu 

-285 -280 -275 

act tgg tec ttg ctg aac cat tta gqt ggg aga cac aga cct ggt 
Thr Trp ser Leu Leu Asn His Leu Gly Gly Arg His Arg Pro Gly 

-270 -265 ~ -260 



256 



301 



etc ctt tec cct etc tta gag gtt ctg tat gat ggg cac ggg gaa 346 
Leu Leu ser Pro Leu Leu Glu Val Leu Tyr Asp Gly His Gly Glu 

-255 -250 -245 

ccc ccc agg ctg cag cca gat gac aga get ttg cgc tac atg aag 391 
pro Pro Arg Leu Gin Pro Asp Asp Arg Ala Leu Arg Tyr Met Lys 

-240 -235 -230 

agg etc tat aag gca tac get acc aag gag ggg acc cct aaa tec 436 
Arg Leu Tyr Lys Ala Tyr Ala Thr Lys Glu Gly Thr Pro Lys ser 

-225 -220 -215 

aac aga cgc cac etc tac aac act gtt egg etc ttc acc ccc tgt 481 
Asn Arg Arg His Leu Tyr Asn Thr val Arg Leu Phe Thr Pro Cys 

-210 -205 -200 

get cag cac aag cag get cct ggg gac ctg gcg gca g gtgtgtagga 528 
Ala Gin His Lys Gin Ala Pro Gly Asp Leu Ala Ala 

-195 -190 

gcagattggt taatgggtgg agggaagaag aaagaccttt ttgeatttea gttacataaa 588 

ggagttggcc ctgctccttg acttgeattt tactttgeat ggtactcaat atccaaacaa 648 

acctggtgct tgatcttact tactgtttat tcctaatggc ctcatgggtt gatgtaggct 708 

natcccaccc tgacgtttaa ggcttgagaa tgtggggaga aaagggacag aagcacattc 768 

tgaggtactg attccttgat ttgacttcct gttacatatg gcattactgt tggattgttt 828 

ttcttctcag ga acc ttt cca tea gtq gat ctg ctg ttt aac ctg gat 876 

Gly Thr Phe Pro ser val Asp Leu Leu Phe Asn Leu Asp 
-185 -180 -175 

cgt gtt act gtt gtq gaa cat tta ttc aag tea gtc ttg ctg tat 921 
Arg val Thr val Val Glu His Leu Phe Lys Ser val Leu Leu Tyr 

-170 -165 -160 

act ttc aac aac tec att tct ttt ccc ttt cct gtt aaa tgt ata 966 
Thr Phe Asn Asn ser lie ser Phe Pro Phe Pro val Lys Cys lie 

-155 -150 -145 
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aac ctg gtg ata aaa gag cca gag ttt tct age aag act etc 

Asn Leu val lie Lys G~Iu Pro Glu Phe ser ser Lys Thr Leu 
-140 -135 -130 

cct aga get cca tac tea ttt ace tat aac tea cag ttt gaa ttt 
Pro Arg Ala Pro Tyr Ser Phe Thr Tyr Asn Ser Gin Phe Glu Phe 

-125 -120 -115 

aga aag aaa tac aaa tgg atg gag att gat gtg acg get cct ctt 
Arg Lys Lys Tyr Lys Trp Met Glu lie Asp Val Thr Ala Pro Leu 

-110 -105 -100 

gag cct ctg gtg gee tec cac aag agg aat att cac atg tct gta aat 
Glu Pro Leu val Ala ser His Lys Arg Asn lie His Met ser val Asn 

-95 -90 -85 

ttt aca tgt gcg gaa gac cag ctg cag cat cct tea gcg egg gac age 
Phe Thr cys Ala Glu Asp Gin Leu Gin His Pro ser Ala Arg Asp ser 
-80 -75 -70 

ctg ttt aac atg act ctt etc gta gcg ccc tea ctg ctt ttg tat ctg 
Leu Phe Asn Met Thr Leu Leu Val Ala Pro Ser Leu Leu Leu Tyr Leu 
-65 -60 -55 

aac gac aca agt get cag get ttt cac agg tgg cat tec etc cac cct 
Asn Asp Thr Ser Ala Gin Ala Phe His Arg Trp His ser Leu His Pro 
-50 -45 -40 -35 

aaa agg aag cct tea cag gqt cct gac cag aag aga ggg eta tct gec 
Lys Arg Lys Pro ser Gin Gly Pro Asp Gin Lys Arg Gly Leu Ser Ala 
' -30 -25 -20 

tac ccc gtg gga gaa gaa get get gag ggt gta aga teg tec cgt cac 
Tyr Pro Val Gly Glu Glu Ala Ala Glu Gly Val Arg Ser ser Arg His 

-15 -10 -5 

cgc aga gac cag gag agt gee age tct gaa ttg aag aag cct ctg gtt 
Arg Arg Asp Gin Glu Ser Ala Ser Ser Glu Leu Lys Lys Pro Leu Val 
-11 5 10 

cca get tea gtc aat ctg agt gaa tac ttc aaa cag ttt ctt ttt ccc 
Pro Ala Ser Val Asn Leu ser Glu Tyr Phe Lys Gin Phe Leu Phe Pro 
15 20 25 30 

cag aat gaa tgt gag etc cat gac ttt aga ctt age ttt agt cag ctg 
Gin Asn Glu Cys Glu Leu His Asp Phe Arg Leu Ser Phe Ser Gin Leu 

.35 40 45 

aag tgg gac aac tgg att gtg gec cca cac aaa tac aac cct cga tac 
Lys Trp Asp Asn Trp lie val Ala Pro His Lys Tyr Asn Pro Arg Tyr 



50 



55 



60 



tgt aaa ggg gac tgt ccc agg gcg gtc gga cat egg tat gqc ttt ccg 
cys Lys Gly Asp cys Pro Arg Ala Val Gly His Arg Tyr Gly Phe Pro 
65 70 75 

gtt cac ace atg gtg cag aac ate ate cat gag aaa ctt gac tec tea 
val His Thr Met val Gin Asn lie lie His Glu Lys Leu Asp ser ser 
80 85 90 

gtg cca aga cca tec tgt gta cct gee aag tat age cct ttg agt gtt 
val Pro Arg Pro ser cys val Pro Ala Lys Tyr Ser Pro Leu ser val 
95 100 105 110 

ttg gee ate gag cct gat ggc tea ate get tat aaa gaa tat gaa gat 
Leu Ala lie Glu Pro Asp Gly Ser He Ala Tyr Lys Glu Tyr Glu Asp 

115 120 125 



1011 



1056 



1101 



1149 



1197 



1245 



1293 



1341 



1389 



1437 



1485 



1533 



1581 



1629 



1677 



1725 



1773 
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ata gcc act aag tgt acc tgt cgt taacagactc ctgtcaagta 
lie Ala Thr Lys Cys Thr Cys Arg 
130 135 

aaaccatgag tgtcctggcc agtgtaaatg ccgcgcccct gtctatgcct ttgggagga 

<210> 2 

<211> 453 

<212> PRT 

<213> Ovis aries 



<220> 

<221> misc_feature 

<222> (122) . . (124) 

<223> atg start codon. 
<220> 

<221> mi sc_f eatu re 

<222> (709) . . () 

<223> n represents approximately 900 bp of unsequenced intron. 
<220> 

<221> misc_feature 
<222> (1801) . . (1803) 

<223> taa stop codon. 
<400> 2 

Met Ala Leu Pro Asn Lys Phe Phe Leu Trp Phe Cys Cys Phe Ala 

-315 -310 -305 

Trp Leu Cys Phe Pro lie Ser Leu Asp ser Leu Pro ser Arg Gly 

-300 -295 -290 

Glu Ala Gin He val Ala Arg Thr Ala Leu Glu Ser Glu Ala Glu 

-285 -280 -275 

Thr Trp Ser Leu Leu Asn His Leu Gly Gly Arg His Arg Pro Gly 

-270 -265 -260 

Leu Leu ser Pro Leu Leu Glu val Leu Tyr Asp Gly His Gly Glu 

-255 -250 -245 

Pro pro Arg Leu Gin Pro Asp Asp Arg Ala Leu Arg Tyr Met Lys 

-240 -235 -230 

Arg Leu Tyr Lys Ala Tyr Ala Thr Lys Glu Gly Thr Pro Lys Ser 
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220 



-215 



sn Arg Arg His Leu Tyr Asn Thr val Arg Leu Phe Thr Pro Cys 

-210 -205 -200 



Ala Gin His Lys Gin Ala Pro Gly Asp Leu Ala Ala Gly Thr Phe 

-195 -190 -185 

pro ser val Asp Leu Leu Phe Asn Leu Asp Arg val Thr val val 

-180 -175 -170 

Glu His Leu Phe Lys ser val Leu Leu Tyr Thr Phe Asn Asn ser 

-165 -160 -155 

He ser Phe Pro Phe Pro val Lys Cys lie cys Asn Leu val lie 

-150 -145 -140 

Lys Glu Pro Glu Phe ser Ser Lys Thr Leu Pro Arg Ala Pro Tyr 

-135 -130 -125 

Ser Phe Thr Tyr Asn ser Gin Phe Glu Phe Arg Lys Lys Tyr Lys 

-120 -115 -110 

Tro Met Glu lie Asp val Thr Ala Pro Leu Glu Pro Leu val Ala ser 

-105 -100 -95 

His Lys Arg Asn lie His Met Ser val Asn Phe Thr Cys Ala Glu Asp 
-90 -85 -80 

Gin Leu Gin His Pro Ser Ala Arg Asp ser Leu Phe Asn Met Thr Leu 
-75 -70 -65 

Leu Val Ala Pro Ser Leu Leu Leu. Tyr Leu Asn Asp Thr Ser Ala Gin 
-60 -55 -50 -45 

Ala Phe His Arg Trp His ser Leu His Pro Lys Arg Lys Pro ser Gin 

-40 -35 -30 

Gly Pro Asp Gin Lys Arg Gly Leu Ser Ala Tyr Pro val Gly Glu Glu 

K -25 -20 -15 

Ala Ala Glu Gly val Arg ser ser Arg His Arg Arg Asp Gin Glu ser 
-10 -5 -11 

Ala Ser Ser Glu Leu Lys Lys Pro Leu val Pro Ala Ser Val Asn Leu 
5 10 15 20 

Ser Glu Tyr Phe Lys Gin Phe Leu Phe Pro Gin Asn Glu cys Glu Leu 

25 30 35 



His Asp Phe Arg Leu Ser Phe Ser Gin Leu Lys Trp Asp Asn Trp lie 
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40 45 50 

val Ala Pro His Lys Tyr Asn Pro Arg Tyr cys Lys Gly Asp cys Pro 
55 60 65 

Ara Ala val Gly His Arg Tyr Gly Phe pro val His Thr Met val Gin 
70 75 80 

Asn lie He His Glu Lys Leu Asp Ser ser val Pro Arg Pro Ser Cys 
85 90 95 100 

val Pro Ala Lys Tyr ser Pro Leu ser val Leu Ala lie Glu Pro Asp 

105 110 115 

Gly ser lie Ala Tyr Lys Glu Tyr Glu Asp Met lie Ala Thr Lys cys 

120 125 130 

Thr cys Arg 
135 

<210> 3 
<211> 1362 
<212> DNA 
<213> Ovis aries 



<220> 

<221> misc_feature 

<222> (1) • • (3) 

<223> atg start codon 

<220> 

<221> CDS 

<222> (1)..(1359) 

<220> 

<221> mat_peptide 

<222> (955) . . Q 



<220> 

<221> mutation 
<222> (1183) . . (1185) 
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c to t 1184 in [787] sheep changing serine tct codon to ttt pheny 
1 al ani ne 



<220> 
<221> 
<222> 
<223> 



misc_feature 
C1360) . . C1362) 
taa stop codon 



<400> 3 

atg gcg ctt ccc aac aaa ttc ttc ctt tgg ttt tgc tgc ttt gee 

Met Ala Leu Pro Asn Lys Phe Phe Leu Trp Phe Cys Cys Phe Ala 

-315 -310 -305 

tgg etc tgt ttt cct att age ctt gat tct ctg cct tct agg gga 
Tro Leu cys Phe Pro lie Ser Leu Asp Ser Leu Pro ser Arg Gly 

-300 -295 -290 

gaa get cag att gta get agg act gcg ttg gaa tct gag get gag 
Glu Ala Gin He Val,Ala Arg Thr Ala Leu Glu Ser Glu Ala Glu 

-285 -280 -275 

act tgg tec ttg ctg aac cat tta ggt ggg aga cac aga cct ggt 
Thr Trp ser Leu Leu Asn His Leu Gly Gly Arg His Arg Pro Gly 



-270 



y 

-265 



-260 



etc ctt tec cct etc tta gag gtt ctg tat gat ggg cac ggg gaa 
Leu Leu ser pro Leu Leu Glu Val Leu Tyr Asp Gly His Gly Glu 

-255 -250 -245 

ccc ccc agg ctg cag cca gat gac aga get ttg cgc tac atg aag 
pro Pro Arg Leu Gin pro Asp Asp Arg Ala Leu Arg Tyr Met Lys 

-240 -235 -230 

agg etc tat aag gca tac get acc aag gag ggg acc cct aaa tec 
Arg Leu Tyr Lys Ala Tyr Ala Thr Lys Glu Gly Thr Pro Lys ser 

-225 -220 -215 

aac aga cgc cac etc tac aac act gtt egg etc ttc acc ccc tgt 
Asn Arg Arg His Leu Tyr Asn Thr Val Arg Leu Phe Thr Pro Cys 

-210 -205 -200 

get cag cac aag cag get cct ggg gac ctg gcg gca gga acc ttt 
Ala Gin His Lys Gin Ala Pro Gly Asp Leu Ala Ala Gly Thr Phe 



-195 



-190 



-185 



cca tea gtg gat ctg ctg ttt aac ctg gat cgt gtt act gtt gtg 
Pro ser Val Asp Leu Leu Phe Asn Leu Asp Arg val Thr Val val 

-180 -175 -170 

gaa cat tta ttc aag tea gtc ttg ctg tat act ttc aac aac tec 
Glu His Leu Phe Lys Ser val Leu Leu Tyr Thr Phe Asn Asn Ser 

-165 -160 -155 

att tct ttt ccc ttt cct gtt aaa tgt ata tgc aac ctg gtg ata 
lie ser Phe Pro Phe Pro Val Lys cys lie cys Asn Leu Val lie 

-150 -145 -140 

aaa gag cca gag ttt tct age aag act etc cct aga get cca tac 
Lys Glu Pro Glu Phe Ser Ser Lys Thr Leu Pro Arg Ala Pro Tyr 

-135 -130 -125 
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495 
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ttt acc tat aac tea cag ttt gaa ttt aga aag aaa tac aaa 630 
phe Thr Tyr Asn ser Gin Phe Glu Phe Arg Lys Lys Tyr Lys 
-120 -115 -110 



tgg atg gag att gat gtg acg get cct ett gag cct ctg gtg gee tec 
Trp Met Glu lie Asp VaT Thr Ala Pro Leu Glu Pro Leu val Ala ser 

-105 -100 -95 



get get gag ggt gta aga teg tec cgt cac cgc aga gac cag gag agt 
Ala Ala Glu Gly Val Arg ser ser Arg His Arg Arg Asp Gin Glu ser 
-10 -5 " -1 1 



gtg gee eea cac aaa tac aac cct cga tac tgt aaa ggg gac tgt ccc 

Val Ala Pro His Lys Tyr Asn Pro Arg Tyr Cys Lys Gly Asp Cys Pro 

55 60 65 

£99 gcg gtc gga cat egg tat ggc ttt ccg gtt cac acc atg gtg cag 

Arg Ala Val Gly His Arg Tyr Giy Phe Pro val His Thr Met val Gin 

70 75 80 

aac ate ate cat gag aaa ett gac tec tea gtg cca aga cca tec tgt 

Asn He lie His Glu Lys Leu Asp Ser ser Val Pro Arg Pro ser Cys 

85 90 95 100 



678. 



cac aag agg aat att cac atg tct gta aat ttt aca tgt gcg gaa gac 726 
His Lys Arg Asn lie His Met ser val Asn Phe Thr Cys Ala Glu Asp 
-90 -85 -80 

cag ctg cag cat cct tea gcg egg gac age ctg ttt aac atg act ett 774 
Gin Leu Gin His Pro Ser Ala Arg Asp Ser Leu Phe Asn Met Thr Leu 
-75 -70 -65 

etc gta gcg ccc tea ctg ett ttg tat ctg aac gac aca agt get cag 822 
Leu Val Ala Pro ser Leu Leu Leu Tyr Leu Asn Asp Thr ser Ala Gin 
-60 -55 -50 -45 

get ttt cac agg tgg cat tec etc cac cct aaa agg aag cct tea cag 870 
Ala Phe His Arg Trp His Ser Leu His Pro Lys Arg Lys Pro Ser Gin 

-40 -35 -30 

ggt cct gac cag aag aga ggg eta tct gee tac ccc gtg gga gaa gaa 918 
Gly Pro Asp Gin Lys Arg Gly Leu Ser Ala Tyr Pro val Gly Glu Glu 

-25 -20 -15 



966 



gee age tct gaa ttg aag aag cct ctg gtt cca get tea gtc aat ctg 1014 

Ala Ser Ser Glu Leu Lys Lys Pro Leu val Pro Ala ser val Asn Leu 

5 10 15 20 

agt gaa tac ttc aaa cag ttt ett ttt ccc cag aat gaa tgt gag etc 1062 

Ser Glu Tyr Phe Lys Gin Phe Leu Phe Pro Gin Asn Glu cys Glu Leu 

25 30 35 

cat gac ttt aga ett age ttt agt cag ctg aag tgg gac aac tgg att 1110 

His Asp Phe Arg Leu Ser Phe ser Gin Leu Lys Trp Asp Asn Trp lie 

40 45 50 



1158 



1206 



1254 



gta cct gee aag tat age cct ttg agt gtt ttg gee ate gag cct gat 1302 
Val Pro Ala Lys Tyr Ser Pro Leu Ser Val Leu Ala lie Glu Pro Asp 

105 110 115 

ggc tea ate get tat aaa gaa tat gaa gat atg ata gec act aag tgt 1350 
Gly ser lie Ala Tyr Lys Glu Tyr Glu Asp Met lie Ala Thr Lys Cys 

120 125 130 

acc tgt cgt taa 1362 
Thr Cys Arg 
135 
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Seamus,ST25 



0> 4 

<211> 453 

<212> PRT 

<213> Ovis aries 



<220> 

<221> misc_feature 
<222> (1) . . C3) 
<223> atg start codon. 
<220> 

<221> misc_feature 
<222> (1360) .. (1362) 
<223> taa stop codon. 
<400> 4 

Met Ala Leu pro Asn Lys Phe Phe Leu Trp Phe Cys Cys Phe Ala 

-315 -310 -305 

Trp Leu Cys Phe Pro He ser Leu Asp ser Leu Pro ser Arg Gly 

-300 -295 -290 

Glu Ala Gin lie Val Ala Arg Thr Ala Leu Glu Ser Glu Ala Glu 

-285 -280 -275 

Thr Trp ser Leu Leu Asn His Leu Gly Gly Arg His Arg Pro Gly 

-270 -265 -260 

Leu Leu Ser Pro Leu Leu Glu val Leu Tyr Asp Gly His Gly Glu 

-255 -250 -245 

Pro Pro Arg Leu Gin Pro Asp Asp Arg Ala Leu Arg Tyr Met Lys 

-240 -235 -230 

Arg Leu Tyr Lys Ala Tyr Ala Thr Lys Glu Gly Thr Pro Lys ser 

-225 -220 -215 

Asn Arg Arg His Leu Tyr Asn Thr Val Arg Leu Phe Thr Pro cys 

-210 -205 -200 

Ala Gin His Lys Gin Ala Pro Gly Asp Leu Ala Ala Gly Thr Phe 

-195 -190 -185 

Pro ser Val Asp Leu Leu Phe Asn Leu Asp Arg val Thr Val val 

-180 -175 -170 
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His Leu phe Lys Ser val Leu Leu Tyr Thr Phe Asn Asn ser 
-165 -160 -155 



lie ser Phe Pro Phe Pro val Lys Cys lie Cys Asn Leu val lie 

-150 -145 -140 

Lys Glu Pro Glu Phe Ser Ser Lys Thr Leu Pro Arg Ala Pro Tyr 

-135 -130 -125 

Ser Phe Thr Tyr Asn Ser Gin Phe Glu Phe Arg Lys Lys Tyr Lys 

-120 -115 -110 

Trp Met Glu lie Asp Val Thr Ala Pro Leu Glu Pro Leu Val Ala Ser 

r-105 -100 -95 

His Lys Arg Asn lie His Met Ser val Asn Phe Thr cys Ala Glu Asp 
-90 -85 -80 

Gin Leu Gin His Pro ser Ala Arg Asp ser Leu Phe Asn Met Thr Leu 
-75 -70 -65 

Leu Val Ala Pro Ser Leu Leu Leu Tyr Leu Asn Asp Thr Ser Ala Gin 
-60 -55 -50 -45 

Ala Phe His Arg Trp His Ser Leu His Pro Lys Arg Lys Pro Ser Gin 

-40 -35 -30 

Gly Pro Asp Gin Lys Arg Gly Leu ser Ala Tyr Pro val Gly Glu Glu 

-25 -20 -15 

Ala Ala Glu Gly val Arg ser ser Arg His Arg Arg Asp Gin Glu ser 
-10 -5 -11 

Ala ser Ser Glu Leu Lys Lys Pro Leu val Pro Ala ser val Asn Leu 
5 10 15 20 

Ser Glu Tyr Phe Lys Gin Phe Leu Phe Pro Gin Asn Glu cys Glu Leu 

25 30 35 

His Asp Phe Arg Leu Ser Phe Ser Gin Leu Lys Trp Asp Asn Trp lie 

40 45 50 

Val Ala Pro His Lys Tyr Asn Pro Arg Tyr cys Lys Gly Asp Cys Pro 
55 60 65 

Arg Ala val Gly His Arg Tyr Gly Phe Pro Val His Thr Met val Gin 
70 75 80 



Asn lie lie His Glu Lys Leu Asp Ser ser val Pro Arg Pro Ser Cys 
85 90 95 100 
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Pro Ala Lys Tyr ser Pro Leu ser val Leu Ala lie Glu Pro Asp 

105 110 115 

Gly ser lie Ala Tyr Lys Glu Tyr Glu Asp Met lie Ala Thr Lys cys 

120 125 130 

— ■ 

Thr Cys Arg 
135 

<210> 5 
<211> 168 
<212> DNA 
<213> Ovis aries 



<220> 

<221> CDS 

<222> (1) . . (168) 

<220> 

<221> mutation 

<222> (82) . . (84) 

<223> c to t at 83 in [787] sheep changing tct serine codon to ttt phen 
ylal anine 



<400> 5 

aac tgg att gtq gcc cca cac aaa tac aac cct cga tac tgt aaa gqg 48 

Asn Trp lie Val Ala Pro His Lys Tyr Asn pro Arg Tyr cys Lys Giy 
1 5 10 ~ 15 



gac tgt ccc agg gcg gtc gqa cat egg tat ggc ttt ccg gtt cac acc 
Asp cys Pro Arg Ala Val Gly His Arg Tyr Gly Phe Pro Val His Thr 

20 25 30 



96 



atg gtq cag aac ate ate cat gag aaa ctt gac tec tea gtq cca aga 144 
Met Val Gin Asn lie lie His Glu Lys Leu Asp Ser Ser Val Pro Arg 
35 40 45 

cca tec tgt gta cct gcc aag tat 168 
Pro ser Cys Val pro Ala Lys Tyr 
50 55 

<210> 6 
<211> 56 
<212> PRT 
<213> ovis aries 
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Val Ala Pro His Lys Tyr Asn Pro Arg Tyr Cys Lys Gly 
5 10 15 

• 

Arg Ala val Gly His Arg Tyr Gly Phe Pro val His Thr 
20 25 30 

Met val Gin Asn lie lie His Glu Lys Leu Asp Ser Ser val Pro Arg 
35 40 45 

Pro ser cys val Pro Ala Lys Tyr 
50 55 

<210> 7 

<211> 1665 

<212> DNA 

<213> Ovis aries 

<220> 

<221> 5'UTR 
<222> CD - - (252) 

<220> 

<221> misc_feature 

<222> (253).. (255) 

<223> atg start codon. 

<220> 

<221> CDS 

<222> (253).. (577) 

<220> 

<221> CDS 

<222> (774) . . (1165) 




<400> 6 



Asn Trp lie 
1 



Asp cys Pro 



<220> 

<221> mtron 
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l2> C578) . . (773) 
<223> n at 685 represents remainder of approx 5.2 kb intron 




<220> 

<221> mis cofeature 

<222> (1253).. (1255) 

<223> position of first codon of mature peptide in wildtype sheep . 



<220> 

<221> mis cofeature 

<222> (685) . . () 

<223> n represents approx 5,2 kb of intron. 
<220> 

<221> misc_feature 

<222> (1628) .. (1630) 

<223> tga stop codon in wildtype sheep. 



<220> 

<221> 3'UTR 

<222> (1631) . . (1665) 



( <220> 

<221> mutation 
<222> (1166) . . (1168) 

<223> c to t at 1166 of [Si] sheep changes cag glutamine codon to tag S 
TOP 



<400> 7 

catgctgcct tgtcccacct gctgtttctg tttgtttgat gcaaagagga caatttagaa 60 

gacctctttt tggttcagga gatcctacca gaggaagaaa cataggacct gcctgccagc 120 

ctttcatttt tccttgccct atcctttgtg gtagtggagc ctggatgctg ttacccatgt 180 

aaaaggaaag gtttaaagcg ttatcctttg ggcttttatc agaacatgtt gctgaacacc 240 

aagcttttca ag atg gtc etc ctg age ate ctt aga ate ctt ctt tgg gga 291 

Met val Leu Leu Ser He Leu Arg lie Leu Leu Trp Gly 
15 10 
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gtq ctt ttt atg gaa cat agg gtc caa atg aca cag gta gqg cag 339 

Val Leu Phe Met Glu His Arg Val Gin Met Thr Gin val Gly Gin 

15 20 25 

ccc tct att gcc cac ctg cct gag gcc cct acc ttg ccc ctg att cag 387 

Pro ser lie Ala His Leu Pro Glu Ala pro Thr Leu Pro Leu lie Gin 
30 35 40 45 



gag ctg eta gaa gaa gcc cct gqc aag cag cag agg aag ccg egg gtc 
Glu Leu Leu Glu Glu Ala Pro Gly Lys Gin Gin Arg Lys Pro Arg val 

50 55 60 



435 



tta gqg cat ccc tta egg tat atg ctg gag ctg tac cag cgt tea get 483 
Leu Gly His Pro Leu Arg. Tyr Met Leu Glu Leu Tyr Gin Arg ser Ala 

65 70 75 

gac gca agt gga cac cct agg gaa aac cgc acc att gqg gcc acc atg 531 
Asp Ala Ser Gly His Pro Arg Glu Asn Arg Thr lie Gly Ala Thr Met 
80 85 90 

gtq agg ctg gtq agg ccg ctg get agt gta gca agg cct etc aga g 577 
val Arg Leu val Arg Pro Leu Ala ser val Ala Arg Pro Leu Arg 
95 100 105 

gtgagttatc atactatatt gttctggtgg gaggggggga gaaaatgggg aagaaaagtg 637 

tagaaaaaag tggatctgtc agttttctgt caggcttcac attgectnea gtttgtactg 697 

agcaggtctg ttagagagac taaggctagg atataagaag etaaegcttt gctcttgttc 757 

cctcttacta atgcag gc tec tgg cac ata cag acc ctg gac ttt cct ctg 808 

Gly ser Trp His lie Gin Thr Leu Asp Phe Pro Leu 
110 115 120 

aga cca aac egg gta gca tac caa eta gtc aga gcc act gtq gtt tac 856 
Arg Pro Asn Arg val Ala Tyr Gin Leu val Arg Ala Thr Val Val Tyr 

125 130 135 

cgc cat cag ctt cac eta act cat tec cac etc tec tgc cat gtq gag 904 
Arg His Gin Leu His Leu Thr His ser His Leu ser cys His Val Glu 

140 145 150 

ccc tgg gtc cag aaa age cca acc aat cac ttt cct tct tea gga aga 952 
Pro Trp Val Gin Lys ser Pro Thr Asn His Phe Pro ser ser Gly Arg 
155 160 165 

ggc tec tea aag cct tec ctg ttg ccc aaa act tgg aca gag atg gat 1000 
Gly ser ser Lys Pro ser Leu Leu Pro Lys Thr Trp Thr Glu Met Asp 
170 175 180 

ate atg gaa cat gtt gqg caa aag etc tgg aat cac aag gqg cgc agg 
He Met Glu His Val Gly Gin Lys Leu Trp Asn His Lys Gly Arg Arg 
185 190 195 200 

gtt eta cga etc cgc ttc gtq tgt cag cag cca aga ggt agt gag gtt 1096 
Val Leu Arg Leu Arg Phe val cys Gin Gin Pro Arg Gly ser Glu Val 

205 210 215 

ctt gag ttc tgg tgg cat gqc act tea tea ttg gac act gtc ttc ttg 1144 
Leu Glu Phe Trp Trp His Gly Thr ser Ser Leu Asp Thr Val Phe Leu 

220 225 230 

tta ctg tat ttc aat gac act tagagtgttc agaagaccaa acctctccct 1195 
Leu Leu Tyr Phe Asn Asp Thr 
235 

aaaggcctga aagagtttac agaaaaagac ccttctcttc tcttgaggag ggctcgtcaa 1255 
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ggcagta ttgcatcgga agttcctggc ccctccaggg agcatgatgg gcctgaaagt 1315 

aaccagtgtt ccctccaccc ttttcaagtc agcttccagc agctgggctg ggatcactgg 1375 

atcattgctc cccatctcta taccccaaac tactgtaagg gagtatgtcc tcgggtacta 1435 

cactatggtc tcaattctcc caatcatgcc atcatccaga accttgtcag tgagctggtg 1495 

gatcagaatg tccctcagcc ttcctgtgtc ccttataagt atgttcccat tagcatcctt 15 55 

ctgattgagg caaatgggag tatcttgtac aaggagtatg agggtatgat tgcccagtcc 1615 

tgcacatgca ggtgacggca aaggtgcagc tagctcaggt ttcccaagaa 1665 

<210> 8 

<211> 239 

<212> PRT 

( <213> ovis aries 



<220> 

<221> nrisc_feature 

<222> (253).. (255) 

<223> atg start coclon. 
<220> 

<221> misc_feature 

<222> (1253). .(1255) 

<223> position of first codon of mature peptide in wildtype sheep. 
<220> 

<223> misc_feature 

<222> (685) . . () 

<223> n represents approx 5.2 kb of intron. 
*<220> 

<221> mi sc_f eature 

<222> (1628) . . (1630) 

<223> tga stop codon in wildtype sheep. 

<400> 8 

- Met Val Leu Leu ser lie Leu Arg lie Leu Leu Trp Gly Leu Val Leu 
15 10 15 

Phe Met Glu His Arg Val Gin Met Thr Gin Val Gly Gin Pro Ser lie 

20 25 30 
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His Leu Pro Glu Ala Pro Thr Leu Pro Leu lie Gin Glu Leu Leu 
35 40 45 

Glu Glu Ala Pro Gly Lys Gin Gin Arg Lys Pro Arg Val Leu Gly His 
50 55 60 

pro Leu Arg Tyr Met Leu Glu Leu Tyr Gin Arg ser Ala Asp Ala Ser 
65 70 75 80 

« 

Gly His Pro Arg Glu Asn Arg Thr lie Gly Ala Thr Met val Arg Leu 

85 90 95 

Val Arg Pro Leu Ala Ser val Ala Arg Pro Leu Arg Gly ser Trp His 

100 105 110 

lie Gin Thr Leu Asp Phe Pro Leu Arg pro Asn Arg val Ala Tyr Gin 
115 120 125 

* 

Leu val Arg Ala Thr val val Tyr Arg His Gin Leu His Leu Thr His 
130 135 140 

ser His Leu ser cys His val Glu Pro Trp val Gin Lys ser Pro Thr 
145 150 155 160 

Asn His Phe pro ser ser Gly Arg Gly ser ser Lys Pro Ser Leu Leu 

165 170 175 

Pro Lys Thr Trp Thr Glu Met Asp lie Met Glu His Val Gly Gin Lys 

180 185 190 

Leu Trp Asn His Lys Gly Arg Arg val Leu Arg Leu Arg Phe val cys 
195 200 205 

Gin Gin pro Arg Gly ser Glu val Leu Glu Phe Trp Trp His Gly Thr 
210 215 220 

Ser ser Leu Asp Thr val Phe Leu Leu Leu Tyr Phe Asn Asp Thr 
225 230 235 

<210> 9 

<211> 1182 

<212> DNA 

<213> ovis areis 



<220> 

<221> misc_feature 
<222> (1) . . (3) 
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3> atg start codon. 



<220> 
<221> 
<222> 
<223> 



seamus.ST25 



mutati on 
(718) . . (720) 

c to t at 718 of [Si] sheep changes cag glutamine codon to tag s 

TOP. 



<220> 

<221> CDS 

<222> (1) . . (717) 



<220> 

<221> misc_feature 

<222> (805) . . (807) 

<223> first codon of mature peptide in wildtype sheep 
<220> 

<221> misc_feature 

<222> (1180) . . (1182) 

<223> tga stop codon. 



( 



<400> 9 

atg gtc etc ctg age ate ctt aga ate ctt ctt tgg gga ctg gtg ctt 

Met val Leu Leu ser He Leu Arg He Leu Leu Trp Gly Leu VaT Leu 

1 5 10 15 

ttt atg gaa cat agg gtc caa atg aca cag gta ggg cag ccc tct att 

Phe Met Glu His Arg val Gin Met Thr Gin val Gly Gin Pro Ser lie 

20 25 30 

gec cac ctg cct gag gec cct ace ttg ccc ctg att cag gag ctg eta 

Ala His Leu Pro Glu Ala Pro Thr Leu Pro Leu lie Gin Glu Leu Leu 

35 40 45 

gaa gaa gec cct ggc aag cag cag agg aag ccg egg gtc tta ggg cat 

Glu Glu Ala Pro Gly Lys Gin Gin Arg Lys Pro Arg val Leu Gly His 

60 



50 



55 



ccc tta egg tat atg ctg gag ctg tac cag cgt tea get gac gca agt 
Pro Leu Arg Tyr Met Leu Glu Leu Tyr Gin Arg Ser Ala Asp Ala Ser 
65 70 75 80 

gga cac cct agg gaa aac cgc acc att ggg gec acc atg gtg agg ctg 
Gly His Pro Arg Glu Asn Arg Thr lie Gly Ala Thr Met Val Arg Leu 

85 90 95 
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agg ccg ctg get agt gta gca agg cct etc aga ggc tec tgg cac 336 

Arg Pro Leu Ala Ser Val Ala Arg Pro Leu Arg giv Ser Trp His 
100 105 110 

ata cag ace ctg gac ttt cct ctg aga cca aac egg gta gca tac caa 384 
He Gin Thr Leu Asp Phe pro Leu Arg pro Asn Arg Val Ala Tyr Gin 
115 120 125 

eta gtc aga gee act gtg gtt tac cgc cat cag ctt cac eta act cat 432 
Leu val Arg Ala Thr val val Tyr Arg His Gin Leu His Leu Thr His 
130 135 140 

tec cac etc tec tgc cat gtg gag ccc tgg gtc cag aaa age cca ace 480 
ser His Leu Ser Cys His Val Glu Pro Trp val Gin Lys ser Pro Thr 
145 150 155 160 

aat cac ttt cct tct tea gga aga ggc tec tea aag cct tec ctg ttg 
Asn His Phe Pro ser ser Gly Arg Gly ser Ser Lys Pro ser Leu Leu 

165 170 175 

ccc aaa act tgg aca gag atg gat ate atg gaa cat gtt ggg caa aag 576 
Pro Lys Thr Trp Thr Glu Met Asp lie Met Glu His val Gly Gin Lys 

180 185 190 

etc tgg aat cac aag ggg cgc agg gtt eta cga etc cgc ttc gtg tgt 624 
Leu Trp Asn His Lys Gly Arg Arg Val Leu Arg Leu Arg Phe Val Cys 
195 200 205 

cag cag cca aga gqt agt gag gtt ctt gag ttc tgg tgg cat ggc act 672 
Gin Gin Pro Arg Gly Ser Glu Val Leu Glu Phe Trp Trp His Gly Thr 
210 215 220 



<210> 10 

<211> 239 

<212> prt 

<213> ovis areis 

<220> 

<221> misc_feature 

<222> (1) . . (3) 



528 



tea tea ttg gac act gtc ttc ttg tta ctg tat 
ser ser Leu Asp Thr val Phe Leu Leu Leu Tyr 
225 230 235 


ttc aat gac act 
Phe Asn Asp Thr 


717 


tagagtgttc 


agaagaccaa acctctccct 


aaaggcctga 


aagagtttac 


agaaaaagac 


777 


ccttctcttc 


tcttgaggag ggctcgtcaa 


gcaggcagta 


ttgcatcgga 


agttcctggc 


837 


ccctccaggg 


agcatgatgg gectgaaagt 


aaccagtgtt 


ccctccaccc 


ttttcaagtc 


897 


agcttccagc 


agctgggctg ggatcactgg 


ateattgetc 


cccatctcta 


taccccaaac 


957 


tactgtaagg 


gagtatgtcc tegggtacta 


cactatggtc 


tcaattctcc 


caatcatgcc 


1017 


atcatccaga 


accttgtcag tgagctggtg 


gatcagaatg 


tccctcagcc 


ttcctgtgtc 


1077 


ccttataagt 


atgttcccat tagcatcctt 


ctgattgagg 


caaatgggag 


tatcttgtac 


1137 


aaggagtatg 


agggtatgat tgcccagtcc 


tgeacatgea 


ggtga 




1182 
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* 

> atg start codon. 
20> 

<221> miscfeature 
<222> (805) . . (807) 

<223> first codon of mature peptide in wildtype sheep. 
<220> 

4 

<221> misc_feature 
<222> (1180) . . (1182) 
<223> tga stop codon. 
<400> 10 

Met Val Leu Leu ser lie Leu Arg lie Leu Leu Trp Gly Leu val Leu 
15 10 15 

Phe Met Glu His Arg Val Gin Met Thr Gin val Gly Gin Pro Ser lie 

20 , 25 30 

Ala His Leu Pro Glu Ala Pro Thr Leu Pro Leu lie Gin Glu Leu Leu 
35 40 45 

Glu Glu Ala Pro Gly Lys Gin Gin Arg Lys Pro Arg val Leu Gly His 
50 55 60 

Pro Leu Arg Tyr Met Leu Glu Leu Tyr Gin Arg ser Ala Asp Ala Ser 
65 70 75 80 

Gly His Pro Arg Glu Asn Arg Thr lie Gly Ala Thr Met Val Arg Leu 

85 90 95 

Val Arg Pro Leu Ala Ser Val Ala Arg Pro Leu Arg Gly ser Trp His 

100 105 110 

lie Gin Thr Leu Asp Phe Pro Leu Arg Pro Asn Arg Val Ala Tyr Gin 
115 120 125 

Leu val Arg Ala Thr val Val Tyr Arg His Gin Leu His Leu Thr His 
130 135 140 

Ser His Leu ser cys His Val Glu Pro Trp Val Gin Lys ser Pro Thr 
145 150 155 160 

Asn His Phe Pro Ser ser Gly Arg Gly ser ser Lys Pro Ser Leu Leu 

165 170 175 

pro Lys Thr Trp Thr Glu Met Asp lie Met Glu His val Gly Gin Lys 

180 185 190 
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Trp Asn His Lys Gly Arg Arg val Leu Arg Leu Arg Phe val cys 
195 200 ~ 205 

Gin Gin Pro Arg Gly ser Glu val Leu Glu Phe Trp Trp His Gly Thr 
210 215 220 

Ser Ser Leu Asp Thr Val Phe Leu Leu Leu Tyr Phe Asn Asp Thr 
225 230 235 

<210> 11 

<211> 168 

<212> DNA 

<213> Ovis aries 



<220> 

<221> CDS 

<222> (D..C84) 

<220> 

<221> mutation 

<222> C85) . . C87) 

<223> c to t at 85 of [si] sheep changes glutamine cag codon to tag STO 
P 



<400> 11 

aga ggt agt gag gtt ctt gag ttc tgg tgg cat ggc act tea tea ttg 48 

Arg Gly Ser Glu Val Leu Glu Phe Trp Trp His Gly Thr ser ser Leu 
1 5 10 15 

gac act gtc ttc ttg tta ctg tat ttc aat gac act tagagtgttc 94 

Asp Thr val Phe Leu Leu Leu Tyr Phe Asn Asp Thr 

20 25 

agaagaccaa acctctccct aaaggcctga aagagtttac agaaaaagac ccttctcttc 154 

tcttgaggag ggct 168 



<210> 


12 


<211> 


28 


<212> 


PRT 


<213> 


Ovis aries 



<400> 12 
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Gly Ser Glu Val Leu Glu Phe Trp Trp His Gly Thr Ser Ser Leu 

5 10 ' 15 



20 25 



Asp Thr val 


<210> 


13 


<211> 


1665 


<212> 


DNA 


<213> 


ovis 



<220> 

<221> 5'UTR 
<222> (1)..(252) 



<220> 

<221> misc_feature 

<222> (253).. (255) 

<223> atg start codon 



<220> 

<221> CDS 

<222> (253).. (577) 



<220> 

<221> CDS 

<222> (774) . . (1627) 

<220> 

<221> intron 

<222> (578) . . (773) 

<223> n at 685 represents approx 5.2 kb intron. 



<220> 

<221> mat_peptide 
<222> (1253).. () 
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<221> misc_feature 
<222> (685).. O 

<223> n represents approx 5.2 kb of intron 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(1628) . . (1630) 
tga stop codon 



( <220> 

<221> 3'UTR 

<222> (1628) . . (1665) 



<220> 

<221> mutation 



<222> 
<223> 



(1547).. (1549) 

eucine codon 48 ° f '" S2 "' sheep chan 9 es a 9 c serine codon to ate isol 



<400> 13 
catgctgcct 

gacctctttt 

ctttcatttt 

aaaaggaaag 

aagcttttca 



tgtcccacct 

tggttcagga 

tccttgccct 

gtttaaagcg 

ag atg gtc 
Met val 



gctgtttctg tttgtttgat gcaaagagga caatttagaa 

gatcctacca gaggaagaaa cataggacct gcctgccagc 

atcctttgtg gtagtggagc ctggatgctg ttacccatgt 

ttatcctttg ggcttttatc agaacatgtt gctgaacacc 

etc ctg age ate ctt aga ate ctt ctt tgg 
Leu Leu ser lie Leu Arg lie Leu Leu Tro 
-265 -260 



P?2 ?l 9 8 ? ? Zt IP atg g ? a c ? t agg 9 tc caa atg aca cag gta 
Gly Leu val Leu Phe Met Glu His Arg val Gin Met Thr Gin val 
"255 -250 -245 

999 cag ccc tct att gee cac ctg cct gag gee cct acc ttg ccc 
Gly Gin Pro ser lie Ala His Leu Pro Glu Ala Pro Thr Leu Pro 
~ 240 -235 -230 

?1 Q St* S? g g ? g ctg cta gaa 9 aa 9 CC cct 99^ aag cag cag agg 

-225 L6U L6U GlU a Pr ° ^ y Ly * Gl " Gl " ^ g 

22^) ~215 



aag ccg egg gtc tta ggg cat 
Lys pro Arg val Leu Gly His 



ccc tta egg tat atg 
Pro Leu Arg Tyr Met 
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ctg gag ctg 
Leu Glu Leu 



60 
120 
180 
240 
288 



333 



378 



423 



468 




seamus.ST25 

-210 -205 -200 

» 

^ cag cgt tea get gac gca agt gga cac cct agg gaa aac cgc 513 
Tyr Gin Arg ser Ala Asp Ala ser Gly His pro Arg Glu Asn Arg 
-195 -190 -185 

acc att ggg gec ace atg gtg agg ctg gtg agg ccg ctg get agt 558 
Thr He Gly Ala Thr Met val Arg Leu val Arg Pro Leu Ala ser 
-180 -175 -170 

gta gca agg cct etc aga g gtgagttatc atactatatt gttctggtgg 607 
Val Ala Arg Pro Leu Arg 
-165 

gaggggggga gaaaatgggg aagaaaagtg tagaaaaaag tggatctgtc agttttctgt 667 

caggcttcac attgectnea gtttgtactg agcaggtctg ttagagagac taaggctagg 727 

atataagaag etaaegcttt gctcttgttc cctcttacta atgcag gc tec tgg 781 

Gly ser Trp 
-160 

cac ata cag acc ctg gac ttt cct ctg aga cca aac egg gta gca 826 
hts lie Gin Thr Leu Asp Phe Pro Leu Arg Pro Asn Arg val Ala 
-155 -150 -145 

tac caa eta gtc aga gec act gtg gtt tac cgc cat cag ctt cac ' 871 
Tyr Gin Leu Val Arg Ala Thr val val Tyr Arg His Gin Leu His 
-140 -135 -130 

eta act cat tec cac etc tec tgc cat gtg gag ccc tgg gtc cag 916 
Leu Thr His ser His Leu Ser Cys His Val Glu Pro Trp val Gin 
-125 -120 -115 

aaa age cca acc aat cac ttt cct tct tea gga aga ggc tec tea 961 
Lys ser Pro Thr Asn His Phe Pro Ser ser Gly Arg Gly ser ser 
-110 -105 -100 



aag 
Lys 


cct 
Pro 


tec 
ser 
-95 


ctg 
Leu 


ttg 
Leu 


ccc 
Pro 


aaa act tgg aca 
Lys Thr Trp Thr 
-90 


gag 

Glu 


atg 
Met 


gat 
Asp 
-85 


ate 
lie 


atg 
Met 


gaa 
Glu 


1009 


cat 
His 


gtt 
val 
-80 


ggg 

Gly 


caa 
Gin 


aag 
Lys 


etc 
Leu 


tgg aat cac 
Trp Asn His 
-75 


aag 
Lys 


ggg 

Gly 


cgc 
Arg 
-70 


agg 
Arg 


gtt 
val 


eta 
Leu 


cga 
Arg 


1057 


etc 
Leu 
-65 


cgc 
Arg 


ttc 
Phe 


gtg 
Val 


tgt 
cys 


cag 
Gin 
-60 


cag cca aga ggt 
Gin Pro Arg Gly 


agt 
Ser 
-55 


gag 
Glu 


gtt 

val 


ctt 
Leu 


gag 

Glu 


ttc 
Phe 
-50 


1105 


tgg 
Trp 


tgg 

Trp 


cat 
His 


ggc 

Gly 


act 
Thr 
-45 


tea 
Ser 


tea ttg gac act 
Ser Leu Asp Thr 

-40 


gtc 

val 


ttc 
Phe 


ttg 
Leu 


tta 
Leu 


ctg 
Leu 
-35 


tat 
Tyr 


1153 


ttc 
Phe 


aat 
Asn 


gac 
Asp 


act 
Thr 
-30 


cag 
Gin 


agt 
Ser 


gtt cag aag 
val Gin Lys 
-25 


acc 
Thr 


aaa 
Lys 


cct 
Pro 


etc 
Leu 


cct 
Pro 
-20 


aaa 
Lys 


ggc 

Gly 


1201 


ctg 
Leu 


aaa 
Lys 


gag 

Glu 

-15 


ttt 
Phe 


aca 
Thr 


gaa 
Glu 


aaa gac cct 
Lys Asp Pro 
-10 


tct 
ser 


ctt 
Leu 


etc 
Leu 


ttg 
Leu 
-5 


agg 
Arg 


agg 
Arg 


get 
Ala 


1249 


cgt 
Arg 


caa 
Gin 
1 


gca 
Ala 


gg c 

Gly 


agt 
Ser 


att 
He 
5 


gca teg gaa 
Ala Ser Glu 


gtt 
val 


cct 
Pro 
10 


ggc 

Gly 


ccc 
Pro 


tec agg 
Ser Arg 


gag 
Glu 
15 


1297 


cat 
His 


gat 
Asp 


ggg 

Gly 


cct 
Pro 


gaa 
Glu 


agt 
Ser 


aac cag tgt 
Asn Gin cys 


tec 
Ser 


etc 
Leu 


cac 
His 


cct 
Pro 


ttt 
Phe 


caa 
Gin 


gtc 

val 


1345 
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Seamus.ST25 
25 



30 



ttc cag cag ctg gqc tgg gat cac tgg ate att get ccc cat etc 
Ser Phe Gin Gin Leu Gly Trp ASp His Trp lie lie Ala Pro His Leu 

35 40 45 

tat ace cca aac tac tgt aag gga gta tgt cct egg gta eta cac tat 
Tyr Thr Pro Asn Tyr Cys Lys Gly. val cys Pro Arg val Leu His Tyr 
50 55 60 



gqt etc aat tct ccc aat cat gec ate ate cag aac ctt gtc 

Gly Leu Asn Ser Pro Asn His Ala lie He Gin Asn Leu val 
65 70 75 

ctg gtg gat cag aat gtc cct cag cct tec tgt gtc cct tat 

Leu val Asp Gin Asn val Pro Gin pro ser Cys val Pro Tyr 
80 85 90 

gtt ccc att ate ate ctt ctg att gag gca aat ggg agt ate 

val Pro lie lie He Leu Leu lie Glu Ala Asn Gly Ser lie 

100 105 

aag gag tat gag ggt atg att gec cag tec tgc aca tgc agg 

Lys Glu Tyr Glu Gly Met lie Ala Gin ser cys Thr Cys Arg 

■115 120 125 

tgacggcaaa ggtgcagcta gctcaggttt cccaagaa 



agt gag 
Ser Glu 



aag tat 
Lys Tyr 
95 

ttg tac 
Leu Tyr 
110 



1393 



1441 



1489 



1537 



1585 



1627 



1665 



<210> 14 

<211> 393 

<212> PRT 

<213> Ovis aries 



<220> 
<221> 
<222> 
<223> 
<220> 
<221> 
<222> 
<223> 
<220> 
<223> 
<222> 
<223> 
<400> 



mise_feature 
(253).. (255) 
atg start codon. 

misc_feature 
(685) . . () 

n represents approx 5.2 kb of intron 

misc_feature 
(1628) . . (1630) 
tga stop codon. 
14 



Met val Leu Leu ser lie Leu Arg He Leu Leu Trp Gly Leu val 

-265 -260 -255 
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Phe Met Glu His Arg val Gin Met Thr Gin val Gly Gin Pro 
-250 -245 -240 



Ser lie Ala His Leu Pro Glu Ala Pro Thr Leu Pro Leu lie Gin 

-235 -230 -225 

Glu Leu Leu Glu Glu Ala Pro Gly Lys Gin Gin Arg Lys Pro Arg 

-220 -215 -210 

Val Leu Gly His Pro Leu Arg Tyr Met Leu Glu Leu Tyr Gin Arg 

-205 -200 -195 

ser Ala Asp Ala ser Gly His Pro Arg Glu Asn Arg Thr lie Gly 

• -190 -185 -180 

Ala Thr Met val Arg Leu val Arg Pro Leu Ala Ser Val Ala Arg 

-175 -170 -165 

Pro Leu Arg Gly Ser Trp His lie Gin Thr Leu Asp Phe Pro Leu 

-160 ' -155 -150 

Arg Pro Asn Arg Val Ala Tyr Gin Leu Val Arg Ala Thr val Val 

-145 -140 -135 

■ 

Tyr Arg His Gin Leu His Leu Thr His Ser His Leu Ser Cys His 

-130 -125 -120 

val Glu pro Trp val Gin Lys ser Pro Thr Asn His Phe Pro ser • 

-li5 -110 -105 

ser Gly Arg Gly ser Ser Lys Pro ser Leu Leu Pro Lys Thr Trp Thr 

-100 -95 -90 

Glu Met Asp. lie Met Glu His val Gly Gin Lys Leu Trp Asn His Lys 
-85 ' -80 -75 

Gly Arg Arg Val Leu Arg Leu Arg Phe Val cys Gin Gin Pro Arg Gly 
-70 -65 -60 

» 

ser Glu Val Leu Glu Phe Trp Trp His Gly Thr ser Ser Leu Asp Thr 
-55 -50 -45 -40 

Val Phe Leu Leu Leu Tyr Phe Asn Asp Thr Gin Ser val Gin Lys Thr 

-35 -30 -25 

Lys Pro Leu Pro Lys Gly Leu Lys Glu Phe Thr Glu Lys Asp Pro Ser 

-20 ' -15 -10 



Leu Leu Leu Arg Arg Ala Arg Gin Ala Gly ser lie Ala Ser Glu Val 
-5 .-11 5 

Page 26 




Seamus.ST25 

Gly Pro Ser Arg Glu His Asp Gly Pro Glu ser Asn Gin Cys ser 

15 20 25 

Leu His Pro Phe Gin val ser Phe Gin Gin Leu Gly Trp Asp His Trp 

30 35 40 

* 

lie lie Ala Pro His Leu Tyr Thr Pro Asn Tyr cys Lys Gly val Cys 

45 50 55 

Pro Arg Val Leu His Tyr Gly Leu Asn ser Pro Asn His Ala lie lie 
60 65 70 

4 " 

Gin Asn Leu Val ser Glu Leu Val Asp Gin Asn Val Pro Gin Pro Ser 
75 80 85 

cys val Pro Tyr Lys Tyr val Pro lie lie lie Leu Leu lie Glu Ala 
90 95 100 105 

Asn Gly Ser He Leu Tyr Lys Glu Tyr Glu Gly Met lie Ala Gin Ser 

110 115 .120 

Cys Thr Cys Arg 

125 

<210> 15 
<211> 1182 
<212> DNA 

<213> ovis aries 



<220> 

<221> misc_feature 

<222> (1)..(3) 

<223> atg start codon. 

<220> 

* 

<221> mutati on 
<222> (1099) . . (1101) 

<223> g to to at 1100 of [S2]. sheep changes age serine codon to ate i so 
leucine codon 



<220> 
<221> CDS 
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CD . . (1179) 



<220> 

<221> mat_pepti de 
<222> (805) . . O 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(1180) . .(1182) 
tga stop codon 



( 



( 



<400> 15 

atg gtc etc ctg age ate ctt aga ate ctt ctt tgg gga ctg gtg 
Met Val Leu Leu Ser He Leu Arg lie Leu Leu Trp Giy Leu val 

-265 -260 -255 

ctt ttt atg gaa cat agg gtc caa atg aca cag gta ggg cag ccc 
Leu Phe Met Glu His Arg val Gin Met Thr Gin val Gly Gin Pro 

-250 -245 -240 

tct att gee cac ctg cct gag gee cct acc ttg ccc ctg att cag 
Ser lie Ala His Leu Pro Glu Ala Pro Thr Leu Pro Leu lie Gin 

-235 -230 -225 

gag ctg eta gaa gaa gee cct ggc aag cag cag agg aag ccg egg 
Glu Leu Leu Glu Glu Ala Pro Gly Lys Gin Gin Arg Lys pro Arg 

-220 -215 -210 

gtc tta ggg cat ccc tta egg tat atg ctg gag ctg tac cag cgt 
val Leu Gly His Pro Leu Arg Tyr Met Leu Glu Leu Tyr Gin Arg 

-205 -200 -195 

tea get gac gca agt gga cac cct agg gaa aac cgc acc att ggg 
ser Ala Asp Ala ser Gly His Pro Arg Glu Asn Arg Thr lie Gly 

-190 '-185 -180 



gee acc atg gtg agg 
Ala Thr Met Val Arg 

-175 

cct etc aga ggc tec 
Pro Leu Arg Gly ser 

-160 

aga cca aac egg gta 
Arg Pro Asn Arg Val 

-145 

tac cgc cat cag ctt 
Tyr Arg His Gin Leu 

-130 

gtg gag ccc tgg gtc 
Val Glu Pro Trp val 

-115 



ctg gtg agg ccg ctg get agt gta gca agg 
Leu val Arg Pro Leu Ala ser Val Ala Arg 

-170 -165 

tgg cac ata cag acc ctg gac ttt cct ctg 
Trp His lie Gin Thr Leu Asp Phe Pro Leu 

-155 -150 

gca tac caa eta gtc aga gee act gtg gtt 
Ala Tyr Gin Leu Val Arg Ala Thr Val val 

-140 -135 



cac eta act cat tec cac etc tec tgc cat 
His Leu Thr His ser His Leu Ser Cys His 

-125 -120 

cag aaa age cca acc aat cac ttt cct tct 
Gin Lys ser Pro Thr Asn His Phe Pro Ser 

-110 -105 



tea gga aga ggc tec tea aag cct tec ctg. ttg ccc aaa act tgg aca 
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45 



90 



135 



180 



225 



270 



315 



360 



405 



450 



495 



543 




( 



cag. aac ctt gtc agt gag ctg gtg gat cag aat gtc cct cag cct tec 
Gin Asn Leu val ser Glu Leu val Asp Gin Asn val Pro Gin Pro ser 
(. 75 80 85 



591 
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Gly Arg Gly Ser Ser Lys Pro Ser Leu Leu Pro Lys Thr Trp Thr 
-100 -95 -90 

gag atg gat ate atg gaa cat gtt ggg caa aag etc tgg aat cac aag 
Glu Met Asp lie Met Glu His Val Gly Gin Lys Leu Trp Asn His Lys 
-85 -80 -75 

ggg cgc agg gtt eta cga etc cgc ttc gtg tgt cag cag cca aga ggt 639 
Gly Arg Arg Val Leu Arg Leu Arg Phe val Cys Gin Gin Pro Arg Gly 
-70 -65 -60 

agt gag gtt ctt gag ttc tgg tgg cat ggc act tea tea ttg gac act 687 
ser Glu Val Leu Glu Phe Trp Trp His Gly Thr ser ser Leu Asp Thr 
-55 -50 -45 -40 

gtc ttc ttg tta ctg tat ttc aat gac act cag agt gtt cag aag ace 735 
Val Phe Leu Leu Leu Tyr Phe Asn Asp Thr Gin Ser Val Gin Lys Thr 

-35 -30 -25 

aaa cct etc cct aaa ggc ctg aaa gag ttt aca gaa aaa gac cct tct 783 
Lys Pro Leu Pro Lys Gly Leu Lys Glu Phe Thr Glu Lys Asp Pro Ser 

-20 -15 -10 

ctt etc ttg agg agg get cgt caa gca ggc agt att gca teg gaa gtt 831 
Leu Leu Leu Arg Arg Ala Arg Gin Ala Gly Ser lie Ala Ser Glu Val 
-5 -11 5 

cct ggc ccc tec agg gag cat gat ggg cct gaa agt aac cag tgt tec 
Pro Gly Pro Ser Arg Glu His Asp Gly Pro Glu Ser Asn Gin cys Ser 
10 15 20 25 

etc cac cct ttt caa gtc age ttc cag cag ctg ggc tgg gat cac tgg 927 
Leu His Pro Phe Gin Val ser Phe Gin Gin Leu Gly Trp Asp His Trp 

30 35 40 

« * 

ate att get ccc cat etc tat ace cca aac tac tgt aag gga gta tgt 

lie lie Ala Pro His Leu Tyr Thr Pro Asn Tyr cys Lys Gly Val Cys 

45 50 55 



879 



975 



cct egg gta eta cac tat ggt etc aat tct ccc aat cat gee ate ate 1023 
Pro Arg val Leu His Tyr Gly Leu Asn ser Pro Asn His Ala lie lie 
60 65 70 



1071 



tgt gtc cct tat aag tat gtt ccc att ate ate ctt ctg att gag gca 1119 
Cys val Pro Tyr Lys Tyr Val Pro lie lie lie Leu Leu lie Glu Ala 
90 95 100 105 

aat ggg agt ate ttg tac aag gag tat gag ggt atg att gee cag tec 1167 
Asn Gly ser lie Leu Tyr Lys Glu Tyr Glu Gly Met lie Ala Gin Ser 

110 115 120 

tgc aca tgc agg tga 1182 
cys Thr cys Arg 

125 

<210> 16 
<211> 393 
<212> PRT 

<213> ovis aries 

» 
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<221> misc_feature 
<222> (1)..(3) 
<223> atg start codon. 
<220> 

<221> misc_feature 
<222> CU80) . . (1182) 
<223> tga stop codon. 
<400> 16 

Met val Leu Leu ser lie Leu Arg lie Leu Leu Trp Gly Leu val 

-265 -260 -255 

Leu Phe Met Glu His Arg Val Gin Met Thr Gin Val Gly Gin Pro 

-250 -245 -240 

Ser lie Ala His Leu Pro Glu Ala Pro Thr Leu Pro Leu lie Gin 

-235 -230 -225 

Glu Leu Leu Glu Glu Ala Pro Gly Lys Gin Gin Arg Lys Pro Arg 

-220 .-215 -210 

val Leu Gly His Pro Leu Arg Tyr Met Leu Glu Leu Tyr Gin Arg 

-205 -200 -195 

Ser Ala Asp Ala ser Gly His Pro Arg Glu Asn Arg Thr lie Gly 

-190 -185 -180 

Ala Thr Met Val Arg Leu Val Arg Pro Leu Ala Ser Val Ala Arg 

-175 -170 -165 

Pro Leu Arg Gly Ser Trp His He Gin Thr Leu Asp Phe Pro Leu 

, -160 -155 -150 

4 

Arg pro Asn Arg Val Ala Tyr Gin Leu Val Arg Ala Thr val val 

-145 -140 -135 

Tyr Arg His Gin Leu His Leu Thr His ser His Leu Ser cys His 

-130 -125 -120 

Val Glu Pro Trp Val Gin Lys Ser Pro Thr Asn His Phe Pro Ser 

-115 -110 -105 

ser Gly Arg Gly Ser Ser Lys Pro Ser Leu Leu Pro Lys Thr Trp Th 

-100 -95 -90 
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Met Asp lie Met Glu His val Gly Gin Lys Leu Trp Asn His Lys 
-85 -80 -75 

Gly Arg Arg val Leu Arg Leu Arg Phe Val cys Gin Gin Pro Arg Gly 
-70 -65 . -60 

Ser Glu Val Leu Glu Phe Trp Trp His Gly Thr ser ser Leu Asp Thr 
-55 -50 -45 -40 

val Phe Leu Leu Leu Tyr Phe Asn Asp Thr Gin Ser val Gin Lys Thr 

-35 -30 -25 

Lys Pro Leu Pro Lys Gly Leu Lys Glu Phe Thr Glu Lys Asp Pro Ser 

-20 -15 -10 

» 

i 

Leu Leu Leu Arg Arg Ala Arg Gin Ala Gly Ser lie Ala ser Glu Val 
-5 -11 5 

Pro Gly Pro Ser Arg Glu His Asp Gly Pro Glu. ser Asn Gin Cys Ser 
10 15 20 25 

Leu His Pro Phe Gin Val Ser Phe Gin Gin Leu Gly Trp Asp His Trp 

30 35 40 

* 

lie lie Ala Pro His Leu Tyr Thr Pro Asn Tyr Cys Lys Gly Val Cys 

.45 .50 55 

Pro Arg Val Leu His Tyr Gly Leu Asn ser Pro Asn His Ala lie lie 
60 65 70 

Gin Asn Leu val ser Glu Leu val Asp Gin Asn Val Pro Glri Pro ser 
75 80 85 

» 

Cys val Pro Tyr Lys Tyr Val Pro lie lie lie Leu Leu lie Glu Ala 
90 95 100 105 

Asn Gly Ser lie Leu Tyr Lys Glu Tyr Glu Gly Met lie Ala Gin ser 

110 115 120 

Cys Thr Cys Arg 

125 

<210> 17 
<211> 168 
<212> DNA 

<213> ovis aries 



<220> 

Page 31 




seamus . ST25 

CDS 



> CD • • (168) 



<220> 

<221> mutation 
<222> (85) . . (87) 

<223> g to t at 86 of GDF9B sheep changes age serine codon to ate isole 
ucine codon 



<400> 17 

gee ate ate cag aac ctt gtc agt gag ctg gtq gat cag aat gtc cct 48 

Ala lie lie Gin Asn Leu val ser Glu Leu Val Asp Gin Asn val Pro 

15 10 15 

( cag cct tec tgt gtc cct tat aag tat gtt ccc att ate .ate ctt ctg 96 
Gin Pro ser cys val Pro Tyr Lys Tyr val pro lie lie lie Leu Leu 

20 25 30 

att gag gca aat gqg agt ate ttg tac aag gag tat gag gqt atg att 144 
lie Glu Ala Asn Giy ser lie Leu Tyr Lys Glu Tyr Glu Gly Met lie 
35 40 45 

gee cag tec tgc aca tgc agg tga 168 
Ala Gin ser cys Thr Cys Arg 
50 55 

<210> 18 

<211> 55 

<212> PRT 

<213> ovis aries 



<400> 18 

Ala He He Gin Asn Leu Val Ser Glu Leu Val Asp Gin Asn Val Pro 
1 5 .10 15 

Gin pro ser cys val Pro Tyr Lys Tyr Val Pro lie lie lie Leu Leu 

20 25 30 

lie Glu Ala Asn Gly ser lie Leu Tyr Lys Glu Tyr Glu Gly Met lie 
35 40 45 

Ala Gin Ser cys Thr cys Arg 
50 55 
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GURE 1: Irish Cambridge and F700 Belclare sheep GDF9 sequence and mutations. 



B. Nucleotide substitutions of the four GDF9 mutations which change an amino acid 
compared with wild-type sheep sequence. 



ACATGTGC G G A A G AC CAGCT 

i 

Wildtype (GAA) ^ , 




G A AG C CTCTGGJTCCAGC TTC 

Wildtype (GTT) 




ACATGTGC G A A A G AC C A G C T I 

324 (AAA) | 



GAAGCCT CTGNT TC CAGCT TC. 

597 (ATT) 





CAAC TGGATTGTGGC CCCAC 

Wildtype (GTG) | 




G GTATG GC TTTC C GGTTC A' 
Wildtype (TTT) | 



.CAACT GGA2 TAT GGCCCCAC, 



714 (ATG 




G GTATG GC TCTCC G GTTCA 
CTCT) I 



787 




GURE 1: Irish Cambridge and F700 Belclare sheep GDF9 sequence and mutations. 

. Predicted amino acid sequence of sheep GDF9 protein. Numbers at the start of each line 
indicate amino acid positions of fuU-length unprocessed protein. Numbers in brackets 
indicate amino acid positions of the mature peptide. The RRHR furin protease cleavage site 
and predicted start of the mature processed peptide is shaded in grey. The filled triangle 
indicates the position of the single intron within the gene. The open triangles indicate 
positions of mutations that confer amino acid substitutions but are not associated with the 
sterility phenotype. The position of the [787] mutation associated with sterility is shaded 
black. 



1 MALPNKFFLW FCCFAWLCFP ISLDSLPSRG EAQIVARTAL ESEAETWSLL. 

R87H [El] 
V 

51 NHLGGRHRPG LLSPLLEVDY DGHGEPPRLQ PDDRALRYMK RLYKAYATKE 
101 GTPKSNRRHL YNTVRLFTPC AQHKQAPGDL AAGTFPSVDL LFNLDRVTVV 
151 EHLFKSVLLY TFNNSISFPF PVKCICNLVI KEPEFSSKTL PRAPYSFTYN 

E241K[324] 

201 SQFEFRKKYK VJMEI DVTAPL EPLVASHKRN IHMSVNFTCA EDQLQHPSAR 

* 

251 DSLFNMTIiLV APSLLLYLND TSAQAFHRWH SLHPKRKPSQ GPDQKRGLSA 

(1) V332I [597] 



3 01 YPVGEEAAEG VRSSSMDQ ESASSELKKP LVPASVNLSE YFKQFLFPQN 



V371M [714] 
V 



S395F [7873 



3 51 EC ELHDFRL S FSQLKWDNWI VAPHKYNPRY CKGDCPRAVG HRYGgPVHTM 



401 VQNI IHEKLD SSVPRPSCVP AKYSPLSVLA IEPDGSIAYK EYEDMIATKC 
(135) 



GURE 2: Irish Cambridge and F700 Belclare sheep GDF9B sequence and mutations. 



Nucleotide substitutions of the two GDF9B mutations which change an amino acid 
compared with wild-type sheep sequence. 



CAATGACACTCAGAGTGTTC 
Wildtype (CAG) I 

CAATGACAC T N A G AGTGTTC 
SI (TAG) 




GTTCC CATTAGCA TCCTTCT 
Wildtype (AGC)| 

GTTCCCATTATCATCCTTCT 
S2 (ATC) I . 



GURE 2: Irish Cambridge and F700 Belclare sheep GDF9B sequence and mutations 



Predicted amino acid sequence of sheep GDF9B protein. Numbers at the start of each line 
indicate amino acid positions of full-length unprocessed protein. Numbers in brackets 
indicate amino acid positions of the mature peptide. The RRAR furin protease cleavage site 
and predicted start of the mature processed peptide is shaded in grey. The filled triangle 
indicates the position of the single intron within the gene. The open triangle indicates the 
position of a single Leu deletion polymorphism. The position of the [SI] and [S2] mutations 
associated with sterility are shaded black. 



1 MVLLSILRIL LWGLVLFMEH RVQMTQVGQP SIAHLPEAPT LPLIQELLEE 
51 APGKQQRKPR VLGHPLRYML ELYQRSADAS GHPRENRTIG ATMVRLVRPL 
101 ASVARPLRGS WHIQTLDFPL RPNRVAYQLiV RATWYRHQL HLTHSHLSCH 
151 VEPWVQKSPT NHFPSSGRGS SKPSLLPKTW TEMDIMEfJVG QKLWNHKGRR 



Q239Ter {Sit 



2Q1 VLRLRFVCQQ PRGSEVLEFW WHGTSSLDTV FLLLYFNDTgj SVQKTKPLPK 

(1) 

2 51 GLKEFTEKDP SLLL^SQA GSIASEVPGP SREHDGPESN QCSLHPFQVS 



3 01 FQQLGWDHWI IAPHLYTPNY CKGVCPRVLH YGLNSPNHAI IQNLVSELVD 



S367I [S2J 



(125) 



351 QNVPQPSCVP YKYVPlgllilj IEANGSILYK EYEGMIAQSC TCR 




GXJKE 3: Schematic representation of genotypes within F700 Belclare and Cambridge 

K _ 



grees 



a F700 Belclare pedigree. The pedigree represents sire R830 mated to three ewes 9704, 
8783 and 7810, and their six female offspring. The table below the pedigree diagram shows 
the phenotype and genotypes for each of the animals. The phenotype of each animal is 
indicated as sterile S, fertile F or n/a (not applicable) for the male. Dashed lines within the 
circles (females) or squares (males) denote carriers of the GDF9B [SI] mutation, diagonal 
lines denote carriers of the GDF9B [S2] mutation and the solid grey denotes carriers of the 
GDF9 [787] mutation (T nucleotide). Half-cirles or half-squares denote those animals 
heterozygous for a mutation. Sire R830 is heterozygous, for [787] (grey) but hemizygous for 
[S2] as GDF9B is carried on the X chromosome. 




{JURE 3: Schematic representation of genotypes within F700 Belclare and Cambridge 
"igrees 



b. Cambridge pedigrees. The pedigree represents sire 962101 mated to two ewes 962158 
and 976234, and their four female offspring, and sire 930142 mated to ewe 8874 and their one 
female offspring. The table below the pedigree diagram shows the phenotype and genotypes 
for each of the animals. The phenotype of each animal is indicated as sterile S, fertile F or n/a 
(not applicable) for the male. Dashed lines within the circles (females) or squares (males) 
denote carriers of the GDF9B [SI] mutation and the solid grey denotes carriers of the GDF9 
[787] mutation (T nucleotide). Dashed grey squares and circles denote carriers of both 
mutations. Half-and-half shading denotes those animals heterozygous for the mutations. Sire 
962101 is heterozygous for [787] (grey) but hemizygous for [S2] as GDF9B is carried on the 
X chromosome. 
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FIGURE 4: Nucleotide arid amino acid of wildtype sheep GDF9 showing 
positions of mutations in Irish Cambridge and F700 Belclare sheep 

* 

Numbers on the right indicate nucleotide position from the atg start codon. Numbers under 
each line indicate amino acid residue numbering from the start of the mature processed active 
peptide. Negative numbers indicate amino acids in the pro-region of the protein. The position 
of the single intron is marked by white boxes inserted into the sequence. Positions of the eight 
nucleotide polymorphisms are marked in bold within boxes, and named according to Table 1 
using square brackets [ ]. The amino acids residues which are changed by the nucleotide 
polymorphisms are underlined. The taa stop codon indicates the end of the protein. 

Sheep GDF9 

atg gcg ctt ccc aac aaa ttc ttc ctt tgg ttt tgc tgc ttt gcc 45 
Met Ala Leu Pro Asn Lys Phe Phe Leu Trp Phe Cys Cys Phe Ala 

-315 -310 -305 

tgg etc tgt ttt cct att age ctt gat tct ctg cct tct agg gga 90 
Trp Leu Cys Phe Pro lie Ser Leu Asp Ser Leu Pro Ser Arg Gly * 

-300 -295 -290 

gaa get cag att gta get agg act gcg ttg gaa tct gag get gag 135 
Glu Ala Gin lie Val Ala Arg Thr Ala Leu Glu Ser Glu Ala Glu 

-285 -280 -275 



act tgg tec ttg ctg aac cat tta ggt ggg aga cac aga cct ggt 
Thr Trp Ser Leu Leu Asn His Leu Gly Gly Arg His Arg Pro Gly 

-270 -265 -260 



180 



etc ctt tec cct etc tta gag gtt ctg tat gat ggg cac ggg' gaa 22 5 

Leu Leu Ser Pro Leu Leu Glu Val Leu Tyr Asp Gly His Gly Glu 

-255 -250 -245 

[E1J 

ccc ccc agg ctg cag cca gat 'gac aga get ttg c[g]c tac atg aag 27 0 

Pro Pro Arg Leu .Gin Pro Asp Asp Arg Ala Leu Arg Tyr Met Lys 

-240 -235 -230 

agg etc tat aag gca tac get acc aag gag ggg acc cct aaa tec 315 
Arg Leu Tyr Lys Ala Tyr Ala Thr Lys Glu Gly Thr Pro Lys Ser 

-22'5 " -220 -215 

aac aga cgc cac etc tac aac act gtt egg etc ttc acc ccc tgt 360 
Asn Arg Arg His ' Leu Tyr Asn Thr Val Arg Leu Phe Thr Pro Cys 

-210 -205 -200 

Intron position 

get cag cac aag cag get cct ggg gac ctg gcg gca gQga acc ttt 405 
Ala Gin His Lys Gin Ala Pro Gly Asp Leu Ala Ala GQly Thr Phe 

-195 -190 -185 

cca tea gtg gat ctg ctg ttt aac ctg gat cgt gtt act gtt gtg 450 
Pro Ser Val Asp Leu Leu Phe Asn Leu Asp Arg Val Thr Val Val 

-180 -175 -170 

[74] E80] 

gaa cat tta ttc aag tea gtjc] ttg ct|g) tat act ttc aac aac tec 495 
Glu His Leu Phe Lys Ser Val Leu Leu Tyr Thr Phe Asn Asn Ser 

-165 -160 -155 

att tct ttt ccc ttt cct gtt aaa tgt ata tgc aac ctg gtg ata 540 
lie Ser Phe Pro Phe Pro Val Lys Cys lie Cys Asn Leu Val lie 

-150 -145 -140 

aaa gag cca gag ttt tct age aag act etc cct aga get cca tac 585 
Lys Glu Pro Glu Phe Ser Ser Lys Thr Leu Pro Arg Ala Pro Tyr 

-135 ~ -130 -125 



tea ttt acc tat aac tea cag ttt gaa ttt aga aag aaa tac aaa 



630 



Ser Phe Thr Tyr Asn Ser Gin Phe Glu - Phe Arg Lys Lys Tyr Lys 

-120 -115 -110 

tgg atg gag att gab gtg acg get cct ctt gag cct ctg gtg gec tec 678 
Trp Met Glu He Asp Val Thr Ala Pro Leu Glu Pro Leu Val Ala Ser 

-105 -100 -95 

[3241 

cac aag agg aat att cac atg tct gta aat ttt aca tgt gcg |g]aa gac 726 
His Lys Arg Asn He His -Met Ser Val Asn Phe Thr Cys Ala Glu Asp 
-90 -85 -80 

cag ctg cag cat cct tea gcg egg gac age ctg ttt aac atg act ctt 774 
Gin Leu Gin His Pro Ser Ala Arg Asp Ser Leu Phe Asn Met Thr Leu 
-75 -70 -65 

etc gta gcg ccc tea ctg ctt ttg tat ctg aac gac aca agt get cag 822 
Leu Val Ala Pro Ser Leu Leu Leu Tyr Leu Asn Asp Thr Ser Ala Gin 
-60 -55 • -50 -45 

get ttt cac agg tgg cat tec etc cac cct aaa agg aag cct tea cag 870 
Ala Phe His Arg Trp His Ser Leu His Pro Lys Arg Lys Pro Ser Gin 

-40 . -35 -30 

ggt cct gac cag aag aga ggg eta tct gec tac ccc gtg gga gaa gaa 918 
Gly Pro Asp Gin Lys Arg Gly Leu Ser Ala Tyr Pro Val Gly Glu Glu 

-25 -20 -15 

get get gag ggt gta aga teg tec cgt cac cgc aga gac cag gag agt 96.6 
Ala Ala Glu Gly Val Arg Ser Ser Arg His Arg Arg Asp Gin Glu Ser 
-10 -5 -11. 

. [5813 [597] 
gec age tct ga|a| ttg aag aag cct ctg |g|tt cca get tea gtc aat ctg 1014 
• Ala Ser Ser Glu Leu Lys Lys Pro Leu Val Pro Ala Ser Val Asn Leu 
5 10 15 20 

agt gaa tac ttc aaa cag ttt ctt ttt ccc cag aat gaa tgt" gag etc • 1062 
Ser Glu Tyr Phe Lys Gin Phe Leu Phe Pro Gin Asn Glu Cys Glu Leu 

25 30 35 

cat gac ttt aga ctt age ttt agt cag ctg aag tgg gac aac tgg att 1110 
His Asp Phe Arg Leu Ser Phe Ser Gin Leu Lys Trp Asp Asn Trp He 

40 45 50 

[7143 

[gjtg gee cca cac aaa tac aac cct cga tac tgt aaa ggg gac tgt ccc 115 8 

Val Ala Pro His Lys Tyr Asn Pro Arg Tyr Cys Lys Gly Asp Cys Pro 
55 60 65 

[7873 

agg gcg gtc gga cat egg tat gge tgt ceg gtt cac acc atg gtg cag 12 06 

Arg Ala Val Gly His Arg Tyr Gly Ser Pro Val His Thr Met Val Gin 
70 75 80 

aac ate ate cat gag aaa ctt gac tec tea gtg cca aga cca tec tgt 1254 
Asn He He His Glu Lys Leu Asp Ser Ser Val Pro Arg Pro Ser Cys 
85 90 95 100 

gta cct gee aag tat age cct ttg agt gtt ttg gec ate gag cct gat 13 02 

Val Pro Ala Lys Tyr Ser Pro Leu Ser Val Leu Ala He Glu Pro Asp 

105 110 ' 115 

ggc tea ate get tat aaa gaa tat gaa gat atg ata gec act aag tgt 13 50 

Gly Ser He Ala Tyr Lys Glu Tyr Glu Asp Met He Ala Thr Lys Cys 

120 125 130 

acc tgt cgt taa cagactc ctgtcaagta aaaccatgag tgtcctggcc 1399 
Thr Cys Arg STOP 
135 



agtgtaaatg ccgcgcc 



1416 



FIGURE 5: Nucleotide and amino acid of wildtype sheep GDF9B showing 
positions of mutations in Irish Cambridge and F700 Belclare sheep 

Numbers on the right indicate nucleotide position from the atg start codon. Numbers 
under each line indicate amino acid residue numbering from the start of the mature 
processed active peptide. Negative numbers indicate amino acids in the pro-region of 
the protein. The position of the single intron is marked by a dashed inserted into the 
sequence. Positions of the four nucleotide polymorphisms are marked in bold within 
boxes, and named according to Table 1 using square brackets [ ]. The amino acids 
residues which are changed by the nucleotide polymorphisms are underlined. 
Asterisks (***) indicate the positions of the previously reported Hanna (Gin to Stop 
codon) and Inverdale (Val to Asp codon) mutations. The tga stop codon indicates the 
end of the protein. 

Sheep BMP15 full 



atg gtc etc ctg age ate ctt aga ate 27 

Met Val Leu Leu Ser He Leu Arg He 

-265 -260 

[Leu] deletion 

cttJ ctt tgg gga ctg gtg ctt ttt atg gaa cat agg gtc caa atg 72 



Leu Leu Trp Gly Leu Val Leu Phe Met Glu His Arg Val Gin Met . 

-255 -250 -245 

aca cag gta ggg cag ccc tct att gec cac ctg cct gag gee cct 117 
Thr Gin Val Gly Gin Pro Ser He Ala His Leu Pro Glu Ala Pro 

-240 -235 -230 

acc ttg ccc ctg att cag gag ctg eta gaa gaa gee cct ggc aag 162 
Thr Leu Pro Leu He Gin Glu Leu Leu Glu Glu Ala Pro Gly Lys 

-225 -220 -215 

cag cag agg aag ccg egg gtc tta ggg cat ccc tta egg tat atg 207 
Gin Gin Arg Lys Pro Arg Val Leu Gly His Pro Leu Arg Tyr Met 

-210 -205 ' -200 

ctg gag ctg tac cag cgt tea get gac gca agt gga cac cct agg 252 
Leu Glu Leu Tyr Gin Arg Ser Ala Asp Ala Ser Gly His Pro Arg 

-195 -190 -185 

gaa aac cgc acc att ggg gee acc atg gtg agg ctg gtg agg ccg 297 
Glu Asn Arg Thr He Gly Ala Thr Met Val Arg Leu Val Arg Pro 

-180 -175 -170 

Intron position 

ctg get agt gta gca agg cct etc aga g gc 327 

Leu Ala Ser Val Ala Arg Pro Leu Arg Gly 

-165 -160 

tec tgg cac ata cag acc ctg gac ttt cct ctg aga cca aac egg 372 
Ser Trp His He Gin Thr Leu Asp Phe Pro Leu Arg Pro Asn Arg 

-155 -150 -145 

gta- gca tac caa eta gtc aga gec act gtg gtt tac cgc cat cag 417 
Val Ala Tyr Gin Leu Val Arg Ala Thr Val Val Tyr Arg His Gin 

-140 -135 -130 

ctt cac eta act cat tec cac etc tec tgc cat gtg gag ccc tgg 462 
Leu His Leu Thr His Ser His Leu Ser Cys His Val Glu Pro Trp 

-125 -120 -115 



gtc cag aaa age cca acc aat cac ttt cct tct tea gga aga ggc 



507 



Val Gin Lys Ser Pro Thr Asri His Phe Pro 'Ser Ser Gly Arg Gly 

-110 -105 -100 

tec tea aag cct tec ctg ttg ccc aaa act tgg aca gag atg gat ate 
Ser Ser Lys Pro Ser Leu Leu Pro Lys. Thr Trp Thr Glu Met Asp lie 

-95 -90 -85 

atg gaa cat gtt ggg caa aag etc tgg aat cac aag ggg cgc agg gtt 
Met Glu His Val Gly Gin Lys Leu Trp Asn His Lys Gly Arg Arg Val 

-80 r75 -70 

eta cga etc cgc ttc gtg tgt cag cag cca aga ggt agt gag gtt ctt 
Leu Arg Leu Arg Phe Val Cys Gin Gin Pro Arg Gly Ser Glu Val Leu 
-65 -60 -55 

gag ttc tgg tgg cat ggc act tea tea ttg gac act gtc' ttc ttg tta 
Glu Phe Trp Trp His Gly Thr Ser Ser Leu Asp Thr Val Phe Leu Leu 
-50 -45 -40 

£S1] [422] 

ctg tat ttc aat gac act @ag agt gtt cag aag ace aaa cct etc ccjt] 
Leu Tyr Phe Asn Asp Thr Gin Ser Val Gin Lys Thr Lys Pro Leu Pro 
-35 -30 ■ . -25 -20 

* 

aaa ggc ctg aaa gag ttt aca gaa aaa gac cct tct ctt etc ttg agg 

Lys Gly Leu Lys Glu Phe Thr Glu Lys Asp Pro Ser Leu Leu Leu Arg 

-15 -10 ' -5 

agg get cgt caa gca ggc agt att gca teg gaa gtt cct ggc 'ccc tec 
Arg Ala Arg Gin Ala Gly Ser He Ala Ser Glu Val Pro Gly Pro Ser 
-11 5 . 10 

agg gag cat gat ggg cct gaa agt aac cag tgt tec etc cac cct ttt 
Arg Glu His Asp Gly Pro Glu Ser Asn Gin Cys Ser Leu His Pro Phe 
~ 15 20 *** 25 

caa gtc age ttc cag cag ctg ggc tgg gat cac tgg ate att get ccc 
Gin Val Ser Phe Gin Gin Leu Gly Trp Asp His Trp He He Ala Pro 
30 *** 35 40 45 

cat etc tat ace cca aac tac tgt aag gga gta tgt cct egg gta eta 
His Leu Tyr Thr Pro Asn Tyr Cys Lys Gly Val Cys Pro. Arg Val Leu 

50 " 55 60 

cac tat ggt etc aat tct ccc aat cat gee ate ate cag aac ctt gtc 
His Tyr Gly Leu Asn Ser Pro Asn. His Ala He He Gin Asn Leu Val 

65 70 75 

agt gag ctg gtg gat cag aat gtc cct cag cct tec tgt gtc cct tat 
Ser Glu Leu Val Asp Gin Asn Val Pro Gin Pro Ser Cys Val Pro Tyr 
80 85 90 

[S2] 

aag tat gtt ccc att age ate ctt ctg att gag gca aat ggg agt ate 
Lys Tyr Val Pro He Ser He Leu Leu He Glu Ala Asn Gly Ser He 
95 100 105 

ttg tac aag gag tat gag ggt atg att gee cag tec tgc aca tgc agg 
Leu Tyr Lys Glu Tyr Glu Gly Met He Ala Gin Ser Cys Thr Cys Arg 
110 115 120 125 



tga eggcaaaggtgea 
STOP 



JHjfiTJRE 6: Alignment of GDF9 and GDF9B protein sequence with other members of the 
^^SSP superfamily members for which structures have been determined 

The farin processing site is indicated as a solid gray block at the start if the sequences. The mature 
processed protein begins at amino acid residue position 4. Conserved cystein molecules involved in 
disulphide bonds are shown in grey shading. Numbers along the bottom provide a relative reference 
to amino acid position, but do not represent the real amino acid residue number of each protein 
because gaps have been introduce to allow alignment, of conserved protein regions. The asterisk * 
indicates the conserved systeine that is present in all other TGFjJ family members except GDF9 and 
GDF9B, and which is responsible for the interchain disulphide bond present in most dimers. Boxed 
letters indicate the [787] serine (S) in GDF9 which is changed to phenylalanine in the mutants 
(position 86 on this diagram), and the [S2] serine (S) in GDF9B which is changed to isoleucine 
(position 118 in this diagram). Bold letters above the sequences refer to the conserved histidine 
involved in dimer hydrogen binding (H) in BMP7 and TGFJ33, and the conserved serine and 
leucine residues required for receptor binding (S, L) in BMP2. 

BMP 2 Human ' |^E^RQAKHKQRKRLKS SpKRHPLYVDF S - DVGWND 

BMP7 /OP1 Human j^£gjSTGSKQ 

GDF9 SHEEP 3^||toQESASSEIiKKPIiVPASVOTiSEYFKQFIiFP QNE&EI1HDFRI1SFS- QLKWDN 

' GDF9 Human . ^URGQETVSSEIiKKPLGPASFNLSEYFRQFLLP QNlsSELHDFRL SF S — QLKWDN 

GDF9 Mouse |^^R!GQKAIRSEAKGPLLTASFNLSEyFKQFIjFP : QNE^ELHDFRIiSFS- QLKWDN 

GDF9B SHEEP rI^QAGSIASEVPGPSREHDGPE SNQGSIiHPFQVSFQ— QLGWDH 

GDF9B Mouse ^^CqACSIESDASCPSQEHDGSV NNQ^SLHPYKVSFH— QLGWDH 

GDF9B Human |&^QADGISAEOTASSSKHSGPE NNQJESLHPFQI SFR— QLGWDH 

TGFB2 Human p^^ALDAAYCFRNVQDN j§CLRPLYIDFKRDLGWK- 

TGFB3 Human pp^ALDTNYCFRNLEEN jCCVRPLYIDFRQDLGWK- 

TGFB1 Human eMrALDTNYCFSSTEKN fe CVRQLYIDFRKDLGWK- 

0 10 20 30 40 50 



BMP 2 Human 
BMP7/OP1 Human 
GDF9 SHEEP 
GDF9 Human 
GDF9 Mouse 
GDF9B SHEEP 
GDF9B Mouse 
GDF9B Human 
TGFB2 Human ■ 
TGFB3 Human " 
TGFB1 Human 



H S 

WI VAPPGYHAFYjjj|HGEj2;PF PIiADHLNSTNHAIVQTLVNSVN- SKIPKAfeGVPTEL SAX SM 
WI I AP EGYAAYYj^EGEQAFPLNS YMNATNHAI VQTLVHF INPETVPKP J|]|aPTQLNAI SV 
WI VAPHKYNPRYgKGDGPRAVGHRYG^PVHTMTV"QNI IHEKLDS SVPRP S^VPAKYSPLSV 
WI VAPHRYNPRYOKGDpPRAVGHRYGS PVHTMVQNI I YEKLDS SVPRP S^VPAKYS PL S V 
WI VAPHRYNPRY^KGD^PRAVRHRYGS PVHTMVQNI I YEKLDP SVPRP SjSVPGKYS PL SV 
WI I APHLYTPNYgKGVpPRVLHYGLNSPNHAI IQl^VSEriVDQITVPQPsSvPTncyVP igl 
WI I APRLYTPNYGKGIj^RVLPYGLNSPNHAI IQSLVNELVNHSVPQPS^VPYNFLPMS I 

wi iappfytpny|kgt|lrvlrdglnspnhaiiqnlinqlvdqsvprp SjpVP YKYVP I S V 

WIHEPKGYNANF^AGAipYLWS SDTQHSRVLSLYNTINPEASASPglvSQDLEPLTI 

WVHE PKGYYANFGSGPSPYLRS ADTTHSTVLGLYNTLNPEASASP&0VPQDLEPLTI 

WIHEPKGYHANFgLGPBPYIWS LDTQYSKVLALYWQHNPGASAAPggVPQALEPLPI 

60 70 80 90 100 * 110 



BMP 2 Human 
BMP7/0P1 Human 
GDF9 SHEEP 
GDF9 Human 
GDF9 Mouse 
GDF9B SHEEP 
GDF9B Mouse 
GDF9B Human 
TGFB2 Human 
TGFB3 Human 
TGFB1 Human 



LYLDE3STEKVVLKNYQDMVVEC^Gj|R 
LYFDDS SIWILKKYRNMVVRA^GGH 
LAI EPDG S I AYKEYEDMI ATKGTjCR 



LTI EPDGS IAYKEYEDMIATKBTigR 
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_ . _ . , he ;M 



LL I ETNG S I L YKEYEGMI AQ SL . 
LMIEANGS ILYKEYEGMIAESCJTlSR 
LYYIGK-TPKIEQLSNMIVKSj 
LYYVGR-TPKVEQLSNMVVKSpi^S 
VYYVGR-KPKVEQLSHMIVRsf " 
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gure 7. Examples of the pattern of progesterone concentrations in plasma of actively 
immunized ewes. Antigen used for immunization and the ewe identification numbers are shown 
at the top of each graph. Markings by vasectomized rams are indicated with arrows. Day 0 = 
corresponds to the beginning of thrice weekly sampling period. 
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Figure 8. Average concentrations of progesterone in plasma following synchronization 
regression. Ewes were administered 100 ml of KLH, GDF9 peptide or BMP15 peptide 
antiplasma Lv. 4 days before synchronization with Estrumate (i.e. PGF 2a , arrowed). 
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